pytorch / torchtune
PyTorch native finetuning library
☆4,267Updated this week
Related projects ⓘ
Alternatives and complementary repositories for torchtune
- Tools for merging pretrained large language models.☆4,788Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆5,919Updated this week
- A native PyTorch Library for large model training☆2,566Updated this week
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,647Updated 3 weeks ago
- A framework for few-shot evaluation of language models.☆6,904Updated this week
- Go ahead and axolotl questions☆7,858Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,350Updated this week
- High-quality datasets, tools, and concepts for LLM fine-tuning.☆1,965Updated 2 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆10,590Updated this week
- Robust recipes to align language models with human and AI preferences☆4,663Updated last month
- ☆4,030Updated 5 months ago
- ☆2,732Updated last month
- Accessible large language models via k-bit quantization for PyTorch.☆6,244Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,026Updated last week
- [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration☆2,498Updated 3 weeks ago
- Efficient Triton Kernels for LLM Training☆3,382Updated this week
- PyTorch native quantization and sparsity for training and inference☆1,541Updated this week
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆4,593Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,178Updated this week
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,426Updated last week
- Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom dataset…☆14,903Updated this week
- Training LLMs with QLoRA + FSDP☆1,419Updated this week
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain…☆8,604Updated this week
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,745Updated 9 months ago
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆1,797Updated last week
- The official PyTorch implementation of Google's Gemma models☆5,283Updated 3 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆7,835Updated 6 months ago
- Train transformer language models with reinforcement learning.☆9,967Updated this week
- Finetune Llama 3.2, Mistral, Phi, Qwen & Gemma LLMs 2-5x faster with 80% less memory☆17,884Updated this week
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆1,823Updated 3 months ago