what does that mean? i said that we didn't see him doing the move before many models were finished training. so these models literally cannot know that this happened.
fair, if u wanna see it that way, ai is bad... just like many other technologies which are being used to do bad stuffs.
yes, ai used for bad is bad. yes, guns used for bad is bad. yes, computers used for bad - is bad.
guns are specifically made to hurt people and kill them, so that's kinda a different thing, but ai is not like this. it was not made to kill or hurt people. currently, it is made to "assist the user". And if the owners of the LLMs (large language models) are pro-elon, they might train in the idea that he is okay actually.
but we can do that too! many people finetune open models to respond in "uncensored" ways. So that there is no gate between what it can and can't say.
LLMs (large language models) are not some oracle, which magically knows all the latest news. however: if you activate search functionality, it will look up things online about it, likely find some article about it, and recognize that reality has moved on since 2023.
the training process being shiddy i completely agree with. that is simply awful and takes a shidload of resources to get a good model.
but... running them... feels oki to me.
as long as you're not running some bigphucker model like GPT4o to do something a smoler model could also do, i feel it kinda is okay.
32B parameter size models are getting really, really good, so the inference (running) costs and energy consumption is already going down dramatically when not using the big models provided by BigEvilCo™.
Models can clearly be used for cool stuff. Classifying texts is the obvious example. Having humans go through that is insane and cost-ineffective. Meanwhile models can classify multiple pages of text in half a second with a 14B parameter (8GB) model.
obviously using bigphucker models for everything is bad. optimizing tasks to work on small models, even at 3B sizes, is just more cost-effective, so i think the general vibe will go towards that direction.
people running their models locally to do some stuff will make companies realize they don't need to pay 15€ per 1.000.000 tokens to OpenAI for their o1 model for everything. they will realize that paying like 50 cents for smaller models works just fine.
if i didn't understand ur point, please point it out. i'm not that good at picking up on stuff..
i kno! i'm already running a smol llama model on the phone, and yeaaaa that's a 2 token per second speed and it makes the phone lag like crazy... but it works!
currently i'm doing this with termux and ollama, but if there's some better foss way to run it, i'd be totally happy to use that instead <3
apparently not. it seems they are refering to the official bs deepseek ui for ur phone. running it on your phone fr is super cool! Imma try that out now - with the smol 1.5B model
heyyyyyy... maybe don use this aggressive word... maybemaybs... - i jus don like it for som reason...
also, nuuuuu don just say cutie like that!!! >
< That's like - im used to reading it in English but not in ~german...also also, who else did u find here who u would call that word an who is also from the grrrmn land?