pytorch / torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
☆3,530Updated this week
Alternatives and similar repositories for torchchat:
Users that are interested in torchchat are comparing it to the libraries listed below
- A PyTorch native library for large model training☆3,470Updated this week
- NanoGPT (124M) in 3 minutes☆2,403Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆2,618Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,481Updated last month
- PyTorch native quantization and sparsity for training and inference☆1,913Updated this week
- Everything about the SmolLM2 and SmolVLM family of models☆2,035Updated last week
- PyTorch native post-training library☆5,014Updated this week
- Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.☆2,996Updated this week
- nanoGPT style version of Llama 3.1☆1,341Updated 7 months ago
- Composable building blocks to build Llama Apps☆7,506Updated this week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,266Updated last month
- Blazingly fast LLM inference.☆5,240Updated this week
- Implementation for MatMul-free LM.☆2,969Updated 4 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,053Updated last month
- ☆2,889Updated 6 months ago
- The official PyTorch implementation of Google's Gemma models☆5,397Updated this week
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,288Updated this week
- Make PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors a…☆1,310Updated this week
- Agentic components of the Llama Stack APIs☆4,174Updated this week
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,356Updated 2 months ago
- 🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide…☆1,247Updated last month
- Training LLMs with QLoRA + FSDP☆1,460Updated 4 months ago
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,200Updated last month
- DataComp for Language Models☆1,263Updated this week
- Sky-T1: Train your own O1 preview model within $450☆3,142Updated last week
- Schedule-Free Optimization in PyTorch☆2,116Updated 3 weeks ago
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,604Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,861Updated 4 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,802Updated 2 weeks ago