pytorch / torchchatLinks

Run PyTorch LLMs locally on servers, desktop and mobile

☆3,599

Alternatives and similar repositories for torchchat

Users that are interested in torchchat are comparing it to the libraries listed below

Sorting:

pytorch / torchtune
PyTorch native post-training library
☆5,366Updated this week
mistralai / mistral-finetune
☆2,990Updated 10 months ago
pytorch / torchtitan
A PyTorch native platform for training generative AI models
☆4,125Updated this week
pytorch / executorch
On-device AI across mobile, embedded and edge for PyTorch
☆3,092Updated this week
huggingface / smollm
Everything about the SmolLM and SmolVLM family of models
☆3,032Updated this week
meta-llama / llama-stack-apps
Agentic components of the Llama Stack APIs
☆4,267Updated 3 months ago
KellerJordan / modded-nanogpt
NanoGPT (124M) in 3 minutes
☆2,892Updated 2 weeks ago
meta-llama / llama-stack
Composable building blocks to build Llama Apps
☆7,937Updated this week
AnswerDotAI / fsdp_qlora
Training LLMs with QLoRA + FSDP
☆1,524Updated 8 months ago
linkedin / Liger-Kernel
Efficient Triton Kernels for LLM Training
☆5,419Updated last week
karpathy / nano-llama31
nanoGPT style version of Llama 3.1
☆1,409Updated 11 months ago
allenai / OLMo
Modeling, training, eval, and inference code for OLMo
☆5,822Updated last week
Lightning-AI / LitServe
The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.
☆3,427Updated this week
facebookresearch / MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
☆1,313Updated 3 months ago
pytorch / ao
PyTorch native quantization and sparsity for training and inference
☆2,219Updated this week
arcee-ai / mergekit
Tools for merging pretrained large language models.
☆6,122Updated this week
EurekaLabsAI / ngram
The n-gram Language Model
☆1,437Updated 11 months ago
Lightning-AI / lightning-thunder
PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…
☆1,384Updated this week
mlfoundations / dclm
DataComp for Language Models
☆1,342Updated 4 months ago
ridgerchu / matmulfreellm
Implementation for MatMul-free LM.
☆3,018Updated last week
cohere-ai / cohere-toolkit
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
☆3,068Updated last week
EricLBuehler / mistral.rs
Blazingly fast LLM inference.
☆5,923Updated this week
facebookresearch / lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
☆4,669Updated 2 weeks ago
likejazz / llama3.np
llama3.np is a pure NumPy implementation for Llama 3 model.
☆987Updated 3 months ago
pytorch-labs / gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,036Updated 3 months ago
google / gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
☆6,514Updated this week
predibase / lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,333Updated 2 months ago
openai / simple-evals
☆3,886Updated 3 weeks ago
google-ai-edge / model-explorer
A modern model graph visualizer and debugger
☆1,288Updated last week
huggingface / nanotron
Minimalistic large language model 3D-parallelism training
☆2,068Updated 3 weeks ago