meta-llama / llama3Links

The official Meta Llama 3 GitHub site

☆28,867

Alternatives and similar repositories for llama3

Users that are interested in llama3 are comparing it to the libraries listed below

Sorting:

meta-llama / llama
Inference code for Llama models
☆58,577Updated 6 months ago
meta-llama / llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…
☆17,686Updated this week
mistralai / mistral-inference
Official inference library for Mistral models
☆10,387Updated 4 months ago
meta-llama / codellama
Inference code for CodeLlama models
☆16,353Updated 11 months ago
QwenLM / Qwen3
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
☆23,510Updated last week
vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆53,703Updated this week
huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆19,184Updated this week
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆18,656Updated this week
unslothai / unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
☆42,983Updated this week
tatsu-lab / stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,098Updated last year
karpathy / llama2.c
Inference Llama 2 in one file of pure C
☆18,597Updated 11 months ago
jzhang38 / TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆8,667Updated last year
karpathy / nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆43,305Updated 7 months ago
karpathy / minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
☆9,786Updated last year
haotian-liu / LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆23,180Updated 11 months ago
QwenLM / Qwen2.5-VL
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆11,866Updated 2 months ago
google-deepmind / gemma
Gemma open-weight LLM library, from Google DeepMind
☆3,569Updated this week
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,583Updated last year
meta-llama / PurpleLlama
Set of tools to assess and improve LLM security.
☆3,646Updated last week
hiyouga / LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
☆55,008Updated last week
openai / tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
☆15,290Updated 4 months ago
meta-llama / llama-models
Utilities intended for use with Llama models.
☆7,170Updated 2 weeks ago
naklecha / llama3-from-scratch
llama3 implementation one matrix multiplication at a time
☆15,069Updated last year
huggingface / trl
Train transformer language models with reinforcement learning.
☆14,863Updated this week
google / gemma_pytorch
The official PyTorch implementation of Google's Gemma models
☆5,518Updated 2 months ago
sgl-project / sglang
SGLang is a fast serving framework for large language models and vision language models.
☆16,386Updated this week
microsoft / LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆12,443Updated 7 months ago
QwenLM / Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
☆18,912Updated last week
deepseek-ai / FlashMLA
FlashMLA: Efficient MLA decoding kernels
☆11,657Updated 3 months ago
lm-sys / FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆38,929Updated 2 months ago