jondurbin/qlora

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jondurbin/qlora)

jondurbin / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

☆79

Alternatives and similar repositories for qlora

Users that are interested in qlora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jondurbin / bagel
View on GitHub
A bagel, with everything.
☆326Apr 11, 2024Updated 2 years ago
jondurbin / airoboros
View on GitHub
Customizable implementation of the self-instruct paper.
☆1,051Mar 7, 2024Updated 2 years ago
severian42 / MoA-LMStudio-Chat
View on GitHub
This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…
☆12Jun 25, 2024Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
View on GitHub
☆75Sep 5, 2023Updated 2 years ago
zarakiquemparte / zaraki-tools
View on GitHub
☆28Aug 30, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ChrisHayduk / qlora-multi-gpu
View on GitHub
QLoRA with Enhanced Multi GPU Support
☆38Aug 8, 2023Updated 2 years ago
emrgnt-cmplxty / SmolTrainer
View on GitHub
☆21Oct 6, 2023Updated 2 years ago
teknium1 / stanford_alpaca-replit
View on GitHub
Modified Stanford-Alpaca Trainer for Training Replit's Code Model
☆47Jun 1, 2023Updated 3 years ago
allenai / easy-to-hard-generalization
View on GitHub
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Jan 17, 2024Updated 2 years ago
arielnlee / Platypus
View on GitHub
Code for fine-tuning Platypus fam LLMs using LoRA
☆625Feb 4, 2024Updated 2 years ago
Danmoreng / llm-pen
View on GitHub
☆16Feb 21, 2026Updated 5 months ago
kaiokendev / cutoff-len-is-context-len
View on GitHub
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Jun 21, 2023Updated 3 years ago
mzbac / qlora-fine-tune
View on GitHub
☆166Jun 1, 2023Updated 3 years ago
FartyPants / Training_PRO
View on GitHub
Traing PRO extension for oobabooga WebUI - recent dev version
☆53Aug 7, 2025Updated 11 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
teknium1 / GPTeacher
View on GitHub
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
☆1,668Sep 15, 2023Updated 2 years ago
Gryphe / BlockMerge_Gradient
View on GitHub
Merge Transformers language models by use of gradient parameters.
☆215Aug 8, 2024Updated last year
jquesnelle / yarn
View on GitHub
YaRN: Efficient Context Window Extension of Large Language Models
☆1,740Apr 17, 2024Updated 2 years ago
mettamind-ai / physics_of_llms
View on GitHub
Các thí nghiệm liên quan tới LLMs cho tiếng Việt (insprised by Physics of LLMs Series)
☆11Oct 21, 2024Updated last year
CoffeeVampir3 / ez-trainer
View on GitHub
Train Llama Loras Easily
☆30Aug 3, 2023Updated 2 years ago
Glavin001 / Expertise-by-AI
View on GitHub
Learn & build: Always available expertise powered by AI
☆14Jul 10, 2023Updated 3 years ago
SALT-NLP / demonstrated-feedback
View on GitHub
☆131Oct 1, 2024Updated last year
terryncew / openline-core
View on GitHub
OpenLine Protocol (OLP) — typed claim/evidence graphs for agentic systems with a 5-number digest. One-command FastAPI demo; auditable, re…
☆15Feb 14, 2026Updated 5 months ago
euclaise / SlimTrainer
View on GitHub
Full finetuning of large language models without large memory requirements
☆92Sep 22, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
swj0419 / detect-pretrain-code-contamination
View on GitHub
☆78Dec 26, 2023Updated 2 years ago
broskicodes / chess-position-embeddings
View on GitHub
code for training and using chess embeddings models
☆14Jun 9, 2024Updated 2 years ago
wuhy68 / Parameter-Efficient-MoE
View on GitHub
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)
☆145Sep 20, 2024Updated last year
dust-tt / llama-ssp
View on GitHub
Experiments on speculative sampling with Llama models
☆129Jun 8, 2023Updated 3 years ago
AnswerDotAI / fsdp_qlora
View on GitHub
Training LLMs with QLoRA + FSDP
☆1,550Nov 9, 2024Updated last year
Re-Align / AlignTDS
View on GitHub
Analyzing LLM Alignment via Token distribution shift
☆17Jan 26, 2024Updated 2 years ago
PavAI-Research / pavai-c3po
View on GitHub
reimagine the implementation of C-3PO droid voice synthesizer and multilingual translation and communication capabilities with the latest…
☆12Mar 6, 2024Updated 2 years ago
fanqiwan / FuseAI
View on GitHub
FuseAI Project
☆600Jan 25, 2025Updated last year
FartyPants / VirtualLora
View on GitHub
extension for text WebUI
☆20Aug 7, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
turboderp / exllama
View on GitHub
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆2,934Sep 30, 2023Updated 2 years ago
lachlansneff / sparsellama
View on GitHub
☆40Mar 25, 2023Updated 3 years ago
kyleliang919 / Long-context-transformers
View on GitHub
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆116Mar 22, 2023Updated 3 years ago
catid / dataloader
View on GitHub
High-performance tokenized language data-loader for Python C++ extension
☆15Jul 22, 2024Updated 2 years ago
MainAIdk / AI-short-generator
View on GitHub
AI shorts generator
☆26Apr 18, 2024Updated 2 years ago
FartyPants / Playground
View on GitHub
Text WebUI extension to add clever Notebooks to Chat mode
☆148Aug 7, 2025Updated 11 months ago
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago