Skip Navigation

LocalLLaMA

Members

3,372
Posts

354
Active Today

42
Created

2 yr. ago

Sort

View

Zetaphor @zemmy.cc
2y ago

Distilling step-by-step: Outperforming larger language models with less training data and smaller model sizes

blog.research.google /2023/09/distilling-step-by-step-outperforming.html

3
ylai @lemmy.ml
2y ago

Efficient Fine-Tuning for Llama-v2-7b on a Single GPU

0
noneabove1182 @sh.itjust.works
2y ago

Exllama V2 released! Available in Ooba! Big speed upgrades!

github.com /turboderp/exllamav2

3
darkeox @kbin.social
2y ago

Help

Trying to run a local Story telling model with KoboldCpp

16
noneabove1182 @sh.itjust.works
2y ago

GitHub - nicholasyager/llama-cpp-guidance: A guidance compatibility layer for llama-cpp-python

github.com /nicholasyager/llama-cpp-guidance

0
RandomLegend [He/Him] @lemmy.dbzer0.com
2y ago

How usable are AMD GPUs?

47
rufus @discuss.tchncs.de
2y ago

Pygmalion-2 has been released

pygmalionai.github.io /blog/posts/introducing_pygmalion_2/

10
ylai @lemmy.ml
2y ago

Beating GPT-4 on HumanEval with a Fine-Tuned CodeLlama-34B

www.phind.com /blog/code-llama-beats-gpt4

1
noneabove1182 @sh.itjust.works
2y ago

Supporting the Open Source AI Community | Andreessen Horowitz

a16z.com /2023/08/30/supporting-the-open-source-ai-community/

4
𞋴𝛂𝛋𝛆 @lemmy.world
2y ago

What is your favorite offline LLM for technical utility, and have you noticed anything unexpected about certain models?

3
noneabove1182 @sh.itjust.works
2y ago

WizardLM introduce the newest WizardCoder 34B based on Code Llama

twitter.com /WizardLM_AI/status/1695396881218859374

2
kelvie @lemmy.ca
2y ago

Is there a good reason why AMD APUs just aren't used with massive amounts of (V)RAM just like the Mac M2 is?

14
noneabove1182 @sh.itjust.works
2y ago

Code Llama: Open Foundation Models for Code | Meta AI Research

ai.meta.com /research/publications/code-llama-open-foundation-models-for-code/

2
noneabove1182 @sh.itjust.works
2y ago

Making LLMs lighter with AutoGPTQ and transformers

huggingface.co /blog/gptq-integration

0
noneabove1182 @sh.itjust.works
2y ago

Jon Durbin: Finished up a first stab at LMoE - LoRA mixture of experts

twitter.com /jon_durbin/status/1694360998797250856

0
rufus @discuss.tchncs.de
2y ago

SeamlessM4T — Massively Multilingual and Multimodal Machine Translation

ai.meta.com /blog/seamless-m4t/

0
AsAnAILanguageModel @sh.itjust.works
2y ago

Hugging Face Releases IDEFICS: An Open-Access 80B Visual Language Model Replicating DeepMind's Flamingo

1
noneabove1182 @sh.itjust.works
2y ago

PEFT 0.5.0: Release GPTQ Quantization, Low-level API · huggingface/peft

github.com /huggingface/peft/releases/tag/v0.5.0

0
noneabove1182 @sh.itjust.works
2y ago

GGUF PR has officially been merged into master!

github.com /ggerganov/llama.cpp/pull/2398

4
Graham Higgins @sh.itjust.works
2y ago

Sam Bleckley's insighful explanation of why "Don't Fire Your Illustrator"

sambleckley.com /writing/dont-fire-your-illustrator.html

1

0 active users