pytorch / torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
☆3,503Updated last week
Alternatives and similar repositories for torchchat:
Users that are interested in torchchat are comparing it to the libraries listed below
- PyTorch native post-training library☆4,846Updated this week
- Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.☆2,843Updated this week
- ☆2,850Updated 5 months ago
- NanoGPT (124M) in 3 minutes☆2,278Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆2,517Updated this week
- nanoGPT style version of Llama 3.1☆1,313Updated 6 months ago
- A PyTorch native library for large model training☆3,313Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆10,095Updated this week
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆7,506Updated last week
- Efficient Triton Kernels for LLM Training☆4,433Updated this week
- Everything about the SmolLM2 and SmolVLM family of models☆1,880Updated 2 weeks ago
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆4,179Updated 3 weeks ago
- Tools for merging pretrained large language models.☆5,260Updated last week
- Composable building blocks to build Llama Apps☆7,256Updated this week
- Agentic components of the Llama Stack APIs☆4,140Updated this week
- PyTorch native quantization and sparsity for training and inference☆1,842Updated this week
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,523Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆2,809Updated 3 months ago
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,434Updated 3 weeks ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,250Updated this week
- A vector search SQLite extension that runs anywhere!☆4,862Updated 3 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,592Updated this week
- The most advanced AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆4,880Updated this week
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆2,562Updated this week
- Large Concept Models: Language modeling in a sentence representation space☆1,927Updated 3 weeks ago
- Knowledge Agents and Management in the Cloud☆3,701Updated last week
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆2,736Updated this week
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆3,939Updated last week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆5,996Updated last month
- DataComp for Language Models☆1,230Updated 2 months ago