huggingface / peftLinks
π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
β20,099Updated this week
Alternatives and similar repositories for peft
Users that are interested in peft are comparing it to the libraries listed below
Sorting:
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β12,975Updated 11 months ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,766Updated last year
- Train transformer language models with reinforcement learning.β16,382Updated this week
- Fast and memory-efficient exact attentionβ20,669Updated this week
- Accessible large language models via k-bit quantization for PyTorch.β7,767Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β24,008Updated last year
- Retrieval and Retrieval-augmented LLMsβ10,887Updated last month
- Large Language Model Text Generation Inferenceβ10,664Updated last week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,307Updated this week
- A framework for few-shot evaluation of language models.β10,706Updated last week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β62,973Updated this week
- Aligning pretrained language models with instruction data generated by themselves.β4,527Updated 2 years ago
- Ongoing research training transformer models at scaleβ14,301Updated this week
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parametersβ5,922Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMsβ63,667Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,117Updated last week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,263Updated 5 months ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.β30,226Updated last year
- An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.β8,487Updated this week
- Instruct-tune LLaMA on consumer hardwareβ18,981Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β18,047Updated 3 weeks ago
- Inference code for Llama modelsβ58,934Updated 10 months ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.β4,989Updated 7 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β40,803Updated this week
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (β¦β11,247Updated this week
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinksβ7,128Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.β8,808Updated last year
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.β7,286Updated this week
- Tools for merging pretrained large language models.β6,468Updated 3 weeks ago
- Latest Advances on Multimodal Large Language Modelsβ16,732Updated 2 weeks ago