pytorch / torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
☆3,383Updated this week
Related projects ⓘ
Alternatives and complementary repositories for torchchat
- Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.☆2,489Updated this week
- PyTorch native finetuning library☆4,336Updated this week
- ☆2,746Updated 2 months ago
- Agentic components of the Llama Stack APIs☆3,894Updated this week
- Blazingly fast LLM inference.☆4,472Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆3,256Updated 3 months ago
- On-device AI across mobile, embedded and edge for PyTorch☆2,191Updated this week
- Composable building blocks to build Llama Apps☆4,594Updated this week
- A native PyTorch Library for large model training☆2,623Updated this week
- PyTorch native quantization and sparsity for training and inference☆1,585Updated this week
- Efficient Triton Kernels for LLM Training☆3,454Updated this week
- nanoGPT style version of Llama 3.1☆1,246Updated 3 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆6,127Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,227Updated last week
- A MLX port of FLUX based on the Huggingface Diffusers implementation.☆988Updated this week
- DataComp for Language Models☆1,157Updated this week
- High-quality datasets, tools, and concepts for LLM fine-tuning.☆2,010Updated 3 weeks ago
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆3,906Updated last month
- Local realtime voice AI☆1,946Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,322Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆4,768Updated 2 weeks ago
- Deploy your agentic worfklows to production☆1,834Updated this week
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,155Updated 2 weeks ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,571Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,830Updated this week
- Parse files for optimal RAG☆3,173Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆10,734Updated last week
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆3,540Updated 2 weeks ago
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆3,677Updated last week
- The easiest way to use Agentic RAG in any enterprise☆3,866Updated this week