pytorch / torchchatLinks
Run PyTorch LLMs locally on servers, desktop and mobile
☆3,624Updated 4 months ago
Alternatives and similar repositories for torchchat
Users that are interested in torchchat are comparing it to the libraries listed below
Sorting:
- ☆3,071Updated 2 months ago
- PyTorch native post-training library☆5,660Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆4,226Updated this week
- A PyTorch native platform for training generative AI models☆5,023Updated last week
- NanoGPT (124M) in 2 minutes☆4,589Updated last week
- PyTorch native quantization and sparsity for training and inference☆2,657Updated last week
- A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.☆3,797Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,753Updated 6 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,406Updated 9 months ago
- llama3.np is a pure NumPy implementation for Llama 3 model.☆991Updated 9 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,594Updated 3 weeks ago
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,439Updated this week
- Training LLMs with QLoRA + FSDP☆1,537Updated last year
- CoreNet: A library for training deep neural networks☆7,016Updated 3 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,180Updated 5 months ago
- Agentic components of the Llama Stack APIs☆4,289Updated 6 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆987Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,154Updated this week
- Minimalistic large language model 3D-parallelism training☆2,529Updated last month
- Efficient Triton Kernels for LLM Training☆6,123Updated this week
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,083Updated last year
- Official inference library for pre-processing of Mistral models☆849Updated last week
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆3,660Updated last week
- Tools for merging pretrained large language models.☆6,761Updated last week
- DataComp for Language Models☆1,413Updated 4 months ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,840Updated this week
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆3,119Updated 8 months ago
- Implementation for MatMul-free LM.☆3,052Updated 2 months ago
- 4M: Massively Multimodal Masked Modeling☆1,789Updated 8 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,325Updated last year