Skip Navigation

Posts
4
Comments
617
Joined
2 yr. ago

  • Well, math is a thing.
    You can deduce basic logic and build up on it to complex structures.
    There are quite some worlds to explored in that realm, that are only build up on basic logic

  • Nvidia, and every GPU manufacturer, got quite some competition in regards to AI, with specialized chips.

    E.g. Cerebras - https://www.forbes.com/sites/craigsmith/2024/08/27/cerebras-speeds-ai-by-putting-entire-foundation-model-on-its-giant-chip/

    I don't think, that Nvidia will keep its dominance in that branch for much longer. Especially when specialized chips can be run much cheaper and with less electricity cost.

    Edit: more up to date article with actual working model on their chip:
    https://cerebras.ai/press-release/cerebras-launches-worlds-fastest-deepseek-r1-distill-llama-70b-inference

  • How does one look cool with that shit keyboard and headset?

  • Thing is, that there are multitudes of sites I want to read like an article or two. Paying a subscription for all of them just isn't feasible.

    By now I even forgot the name of the project, but there was the idea to pay the actual creator for the article I'm reading.
    And I really liked the idea. But as far as I know, that project died - and messy, if I remember correctly.

    But the idea is still good imho.
    I'd have no problem chipping in a bit, when an article is written good and informative. But I don't want to buy the cow, when I only want a sip of milk.

  • Yeah, error reports seem to be some esoteric concept for some people...

    I'm obviously doing the IT stuff for my family (although in the last years, I got out by saying, I'm doing Linux exclusively). So once I got a hysterical call, that the laptop is dead and nothing is working anymore.
    So stupid me rushed over, just to find out that they just couldn't receive emails, because their provider had a problem.

    In my book, if a laptop is dead, there isn't anything on the screen anymore - best case some BIOS stuff is happening.

    I started to switch people over to Linux, at least the next generation and my near surroundings, and decline now most things on windows (or smartphones), because I officially haven't worked with those operating systems since years, and all I do is just searching around and reading things, they could've done themselves, if they wouldn't be too lazy to read.
    I'm all open for people who are lost and need help, but then they mustn't treat me like a fucking employee. I'm here to help you to solve your problems next time yourself, else my work here has no point.

  • Looking at the error reports and the user input logs, a rabbit couldn't do worse...

  • At least in Austria it's even more stupid
    Immigrants from outside the EU have a real hard time to be even allowed to work legally.
    And then people complain, that they're only here to receive social benefits - or younger people sometimes get into illegal drug trade, because they also want to join their Austrian friends in partying or having a smartphone.
    What the hell else should they do, when they're not allowed to work?
    Fucking hypocrites...

    I took in an Indian immigrant in my company with a finished PhD (in Austria), because else he'd be forced to leave the country.
    There were quite some hurdles to get over, so he could even start. Because you need to argue why you give the foreigner the job and they also need a quite high start salary, else they aren't allowed to stay.
    Seems to depend on the country of origin and what their immigration status is, but it's at least not easy for them to even get a job.

  • Yeah, can recommend that one too Although it sometimes seems to have some performance problems with a large amount of files - could be, that it's already fixed though

  • But then the fault is obviously DEI, because it made the birds gay or something...

  • As I said, the architectural changes are quite cool

    As far as I've understood it mostly comes down to splitting it up into multiple expert systems, so you don't need to activate the complete system with every request

    But I've only scratched the surface...

    Also, open source... The weights are made publicly available.
    None of the training data or systems

    Edit: regarding "open source":
    Also Meta's Llama is on huggingface, just like deepseek. I still wouldn't talk about transparency here

  • Distilling OpenAI and Llama models probably also helped quite a bit

    Although I must admit, that the architectural changes are pretty cool

    but I have to add, that I've just started reading into the topic a few weeks ago and don't really have any real practical experience, besides checking out some huggingface docs I got linked yesterday and stupid me hasn't thought about looking there...
    So everything I say is probably bullshit o⁠:⁠-⁠)

  • Python is not a problem
    SW Dev is my job. Just never had real contact with AI before, besides playing around a bit.

    Thank you very much for the link!!

    Edit: thank you very much again, that was pretty much exactly what I was looking for.
    Don't know how I missed to checkout huggingface. Thought of it always just as a github for models and didn't bother checking for docs...
    But that's a great intro with simple tools/tutorials to get a grip on it, thanks!

  • I'd like to look into that, how can I train an existing model further?

    I'm only playing around with ollama, but like to do a bit more - mostly just to fulfill my needs to understand things - but have no idea where to start

  • What else should he do with papers? Read them?!

  • Families? Oo

  • Ah, yeah, thanks!
    That was it!