Skip Navigation

InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)BL
Posts
45
Comments
9
Joined
2 yr. ago

Technology @lemmy.ml

HyperTech News Report #0003 - Expanding Horizons

Technology @lemmy.world

HyperTech News Report #0003 - Expanding Horizons

Technology @lemmy.ml

HyperTech News Report #0002 - A New Challenger Approaches!

Technology @lemmy.world

HyperTech News Report #0002 - A New Challenger Approaches!

Technology @lemmy.ml

HyperTech News Report #0001 - Happy FOSAI Friday!

Technology @lemmy.world

HyperTech News Report #0001 - Happy FOSAI Friday!

World News @lemmy.ml

CodeLlama-34B - the First Open-Source Model Beating GPT-4 on HumanEvals

Technology @lemmy.ml

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

Technology @lemmy.world

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

Technology @lemmy.world

Introducing Stable-Diffusion.cpp (Inference in Pure C/C++)

LocalLLaMA @sh.itjust.works

Cheetor - A New Multi-Modal LLM Strategy Empowered by Controllable Knowledge Re-Injection

Technology @lemmy.ml

Incognito Pilot: The Next-Gen AI Code Interpreter for Sensitive Data

Technology @lemmy.world

Incognito Pilot: The Next-Gen AI Code Interpreter for Sensitive Data

  • I used to feel the same way until I found some very interesting performance results from 3B and 7B parameter models.

    Granted, it wasn’t anything I’d deploy to production - but using the smaller models to prototype quick ideas is great before having to rent a gpu and spend time working with the bigger models.

    Give a few models a try! You might be pleasantly surprised. There’s plenty to choose from too. You will get wildly different results depending on your use case and prompting approach.

    Let us know if you end up finding one you like! I think it is only a matter of time before we’re running 40B+ parameters at home (casually).

  • Technology @lemmy.ml

    Vicuna v1.5 Has Been Released!

    LocalLLaMA @sh.itjust.works

    Vicuna v1.5 Has Been Released!

    Technology @lemmy.world

    Vicuna v1.5 Has Been Released!

  • I am actively testing this out. It's hard to say at the moment. There's a lot to figure out deploying a model into a live environment, but I think there's real value in using them for technical tasks - especially as models mature and improve over time.

    At the moment, though, performance is closer to GPT 3.5 than GPT 4, but I wouldn't be surprised if this is no longer the case within the next year or so.

  • After finally having a chance to test some of the new Llama-2 models, I think you're right. There's still some work to be done to get them tuned up... I'm going to dust off some of my notes and get a new index of those other popular gen-1 models out there later this week.

    I'm very curious to try out some of these docker images, too. Thanks for sharing those! I'll check them when I can. I could also make a post about them if you feel like featuring some of your work. Just let me know!

  • LocalLLaMA @sh.itjust.works

    Free Open-Source AI LLM Guide

    Technology @lemmy.ml

    Free Open-Source AI LLM Guide

    Selfhosted @lemmy.world

    Free Open-Source AI LLM Guide

    Technology @lemmy.world

    Free Open-Source AI LLM Guide

  • OpenAI has launched a new initiative, Superalignment, aimed at guiding and controlling ultra-intelligent AI systems. Recognizing the imminent arrival of AI that surpasses human intellect, the project will dedicate significant resources to ensure these advanced systems act in accordance with human intent. It's a crucial step in managing the transformative and potentially dangerous impact of superintelligent AI.

    I like to think this starts to explore interesting philosophical questions like human intent, consciousness, and the projection of will into systems that are far beyond our capabilities in raw processing power and input/output. What may happen from this intended alignment is yet to be seen, but I think we can all agree the last thing we want in these emerging intelligent machines is to do things we don't want them to do.

    'Superalignment' is OpenAI's response in how to put up these safeguards. Whether or not this is the best method is to be determined.