(Deleted for not relevant anymore)
llama2.c: Inference Llama 2 in one file of pure C by Andrej Karpathy
Leaked GPT-4 Architecture: Demystifying Its Impact & The 'Mixture of Experts' Explained (with code)
Dolphin (based on Llama 1) released by Eric Hartford!
chargoddard's frankensteined 22B llama2
My attempt at explaining group size and act order simply (but definitely not briefly)
My attempt to explain groupsize and act order in GPTQ
Llama-2, Mo’ Lora (proof of concept MOE of LoRAs)
Llama-2 FOSAI & LLM Roundup Series! (Summer 2023 Edition)
What have you been up to recently with your local LLMs?
Llama 2: Full Breakdown
Llama 2 - Meta AI
Retentive Network: A Successor to Transformer for Large Language Models
Finally got my shit together and made git repos of my docker images
llamacpp has added custom RoPE (#2054) · ggerganov/llama.cpp@6e7cca4
LocalGPT Web crawler?
Difference between GGML & GPTQ
Mark Zuckerberg & Meta to Release Commercial Version of its AI/LLM (LLaMA) In Effort to Catch Rivals
Open-Orca/OpenOrca-Preview1-13B · Hugging Face