Skip Navigation

LocalLLaMA

Members

3,372
Posts

354
Active Today

24
Created

2 yr. ago

Sort

View

acec @lemmy.world
2y ago

why reddit?

17
noneabove1182 @sh.itjust.works
2y ago

Phind V7 subjectively performing at GPT4 levels for coding

news.ycombinator.com /item

11
noneabove1182 @sh.itjust.works
2y ago

Min P sampler (an alternative to Top K/Top P) has been merged into llama.cpp

github.com /ggerganov/llama.cpp/pull/3841

0
noneabove1182 @sh.itjust.works
2y ago

HUGE dataset released for open source use

together.ai /blog/redpajama-data-v2

4
noneabove1182 @sh.itjust.works
2y ago

I've started uploading quants of exllama v2 models, taking requests

huggingface.co /bartowski

0
rufus @discuss.tchncs.de
2y ago

Article from October 3

Nearly 10% of people ask AI chatbots for explicit content. Will it lead LLMs astray?

www.zdnet.com /article/nearly-10-of-people-ask-ai-chatbots-for-explicit-content-will-it-lead-llms-astray/

18
noneabove1182 @sh.itjust.works
2y ago

Text Generation Web-UI has been updated to CUDA 12.1, and with it new docker images are needed

0
noneabove1182 @sh.itjust.works
2y ago

Single Digit tokenization improves LLM math abilities by up to 70x

twitter.com /andrew_n_carr/status/1714326003030638848

2
SnokenKeekaGuard @lemmy.dbzer0.com
2y ago

Musical notation

4
ylai @lemmy.ml
2y ago

Are Local LLMs Useful in Incident Response? - SANS Internet Storm Center

isc.sans.edu /diary/rss/30274

0
noneabove1182 @sh.itjust.works
2y ago

Dolphin 2.0 based on mistral-7b released by Eric Hartford

huggingface.co /ehartford/dolphin-2.0-mistral-7b

1
noneabove1182 @sh.itjust.works
2y ago

Beginner questions thread

28
rufus @discuss.tchncs.de
2y ago

Mistral 7B model

mistral.ai /news/announcing-mistral-7b/

8
noneabove1182 @sh.itjust.works
2y ago

Microsoft's latest LLM agent: autogen

microsoft.github.io /autogen/

6
noneabove1182 @sh.itjust.works
2y ago

QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models

arxiv.org /abs/2309.14717

8
noneabove1182 @sh.itjust.works
2y ago

Effective Long-Context Scaling of Foundation Models | Research - AI at Meta

ai.meta.com /research/publications/effective-long-context-scaling-of-foundation-models/

1
noneabove1182 @sh.itjust.works
2y ago

Jeremy Howard: A Hackers' Guide to Language Models

0
noneabove1182 @sh.itjust.works
2y ago

Amazon investing in Anthropic - Expanding access to safer AI with Amazon

www.anthropic.com /index/anthropic-amazon

0
noneabove1182 @sh.itjust.works
2y ago

Very interesting thread about reversal knowledge

twitter.com /OwainEvans_UK/status/1705285631520407821

10
noneabove1182 @sh.itjust.works
2y ago

Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding

arxiv.org /abs/2309.08168

2

0 active users