π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
β20,865Mar 30, 2026Updated this week
Alternatives and similar repositories for peft
Users that are interested in peft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train transformer language models with reinforcement learning.β17,863Updated this week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β13,383Dec 17, 2024Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,925Mar 26, 2026Updated last week
- Fast and memory-efficient exact attentionβ23,062Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β69,375Updated this week
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A high-throughput and memory-efficient inference and serving engine for LLMsβ74,805Updated this week
- Instruct-tune LLaMA on consumer hardwareβ18,954Jul 29, 2024Updated last year
- Accessible large language models via k-bit quantization for PyTorch.β8,092Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,861Jun 10, 2024Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,459Jun 2, 2025Updated 10 months ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,264Jul 17, 2024Updated last year
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,587Mar 23, 2026Updated last week
- Inference code for Llama modelsβ59,275Jan 26, 2025Updated last year
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β158,637Updated this week
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ongoing research training transformer models at scaleβ15,900Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,626Aug 12, 2024Updated last year
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ22,069Jan 23, 2026Updated 2 months ago
- Large Language Model Text Generation Inferenceβ10,815Mar 21, 2026Updated 2 weeks ago
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β33,224Updated this week
- Aligning pretrained language models with instruction data generated by themselves.β4,588Mar 27, 2023Updated 3 years ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parametersβ5,934Mar 14, 2024Updated 2 years ago
- The agent engineering platformβ131,360Mar 28, 2026Updated last week
- A framework for few-shot evaluation of language models.β11,915Mar 18, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- LlamaIndex is the leading document agent and OCR platformβ48,180Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,392Updated this week
- Making large AI models cheaper, faster and more accessibleβ41,372Updated this week
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ11,192Nov 18, 2024Updated last year
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an imageβ33,027Mar 25, 2026Updated last week
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)β4,744Jan 8, 2024Updated 2 years ago
- BELLE: Be Everyone's Large Language model EngineοΌεΌζΊδΈζε―Ήθ―倧樑εοΌβ8,287Oct 16, 2024Updated last year
- Example models using DeepSpeedβ6,813Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.β25,041Mar 28, 2026Updated last week
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.β58,639Updated this week
- Retrieval and Retrieval-augmented LLMsβ11,479Mar 27, 2026Updated last week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,202Sep 30, 2025Updated 6 months ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)β9,273Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMsβ20,286Mar 28, 2026Updated last week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β18,275Updated this week
- Latest Advances on Multimodal Large Language Modelsβ17,534Mar 20, 2026Updated 2 weeks ago