karpathy / llama2.cLinks

Inference Llama 2 in one file of pure C

☆18,820

Alternatives and similar repositories for llama2.c

Users that are interested in llama2.c are comparing it to the libraries listed below

Sorting:

ggml-org / ggml
Tensor library for machine learning
☆13,261Updated this week
ggml-org / llama.cpp
LLM inference in C/C++
☆87,385Updated this week
karpathy / llm.c
LLM training in simple, raw C/CUDA
☆27,769Updated 3 months ago
karpathy / nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆44,924Updated 10 months ago
jzhang38 / TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆8,768Updated last year
mlc-ai / mlc-llm
Universal LLM Deployment Engine with ML Compilation
☆21,471Updated this week
huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆10,566Updated 3 weeks ago
mit-han-lab / streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,057Updated last year
nlpxucan / WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,453Updated 4 months ago
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,680Updated last year
tloen / alpaca-lora
Instruct-tune LLaMA on consumer hardware
☆18,965Updated last year
abetlen / llama-cpp-python
Python bindings for llama.cpp
☆9,647Updated 2 months ago
openai / tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
☆16,161Updated last week
karpathy / minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
☆9,987Updated last year
bigscience-workshop / petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
☆9,804Updated last year
meta-llama / llama
Inference code for Llama models
☆58,807Updated 8 months ago
google / gemma_pytorch
The official PyTorch implementation of Google's Gemma models
☆5,557Updated 4 months ago
openlm-research / open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,524Updated 2 years ago
SJTU-IPADS / PowerInfer
High-speed Large Language Model Serving for Local Deployment
☆8,363Updated 2 months ago
ml-explore / mlx
MLX: An array framework for Apple silicon
☆22,437Updated this week
meta-pytorch / gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,121Updated last month
FMInference / FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,362Updated 11 months ago
BlinkDL / RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆14,009Updated this week
ShishirPatil / gorilla
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
☆12,459Updated last week
tatsu-lab / stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,170Updated last year
google / gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
☆6,587Updated last week
lm-sys / FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆39,141Updated 4 months ago
Lightning-AI / lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,074Updated 3 months ago
meta-llama / llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…
☆17,938Updated this week
meta-llama / codellama
Inference code for CodeLlama models
☆16,366Updated last year