jondurbin/airoboros

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jondurbin/airoboros)

jondurbin / airoboros

Customizable implementation of the self-instruct paper.

☆1,051

Alternatives and similar repositories for airoboros

Users that are interested in airoboros are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jondurbin / bagel
View on GitHub
A bagel, with everything.
☆326Apr 11, 2024Updated 2 years ago
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,247Updated this week
turboderp / exllama
View on GitHub
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆2,934Sep 30, 2023Updated 2 years ago
jondurbin / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆79Apr 10, 2024Updated 2 years ago
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,261Jun 17, 2026Updated last month
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
AblateIt / finetune-study
View on GitHub
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Sep 10, 2023Updated 2 years ago
Gryphe / BlockMerge_Gradient
View on GitHub
Merge Transformers language models by use of gradient parameters.
☆215Aug 8, 2024Updated last year
turboderp-org / exllamav2
View on GitHub
A fast inference library for running LLMs locally on modern consumer-class GPUs
☆4,593Mar 4, 2026Updated 4 months ago
VikParuchuri / textbook_quality
View on GitHub
Generate textbook-quality synthetic LLM pretraining data
☆508Oct 19, 2023Updated 2 years ago
nlpxucan / WizardLM
View on GitHub
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,480Jun 7, 2025Updated last year
teknium1 / GPTeacher
View on GitHub
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
☆1,668Sep 15, 2023Updated 2 years ago
jquesnelle / yarn
View on GitHub
YaRN: Efficient Context Window Extension of Large Language Models
☆1,740Apr 17, 2024Updated 2 years ago
MeetKai / functionary
View on GitHub
Chat language model that can use tools and interpret the results
☆1,596Jun 30, 2026Updated 3 weeks ago
dphnAI / sonar
View on GitHub
Large-scale LLM inference engine
☆1,810Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SkunkworksAI / hydra-moe
View on GitHub
☆416Nov 2, 2023Updated 2 years ago
argilla-io / distilabel
View on GitHub
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆3,344Updated this week
huggingface / alignment-handbook
View on GitHub
Robust recipes to align language models with human and AI preferences
☆5,645May 26, 2026Updated 2 months ago
marella / ctransformers
View on GitHub
Python bindings for the Transformer models implemented in C/C++ using GGML library.
☆1,884Jan 28, 2024Updated 2 years ago
eugenepentland / landmark-attention-qlora
View on GitHub
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Jun 16, 2023Updated 3 years ago
arielnlee / Platypus
View on GitHub
Code for fine-tuning Platypus fam LLMs using LoRA
☆625Feb 4, 2024Updated 2 years ago
jeffrey-fong / Invoker
View on GitHub
The one who calls upon functions - Function-Calling Language Model
☆36Oct 2, 2023Updated 2 years ago
QuixiAI / laserRMT
View on GitHub
This is our own implementation of 'Layer Selective Rank Reduction'
☆240May 26, 2024Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
View on GitHub
☆75Sep 5, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
artidoro / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,968Jun 10, 2024Updated 2 years ago
AutoGPTQ / AutoGPTQ
View on GitHub
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
☆5,075Apr 11, 2025Updated last year
huggingface / text-generation-inference
View on GitHub
Large Language Model Text Generation Inference
☆10,882Mar 21, 2026Updated 4 months ago
databricks / lilac
View on GitHub
Curate better data for LLMs
☆1,072Mar 19, 2024Updated 2 years ago
the-crypt-keeper / can-ai-code
View on GitHub
Self-evaluating interview for AI coders
☆598Jun 21, 2025Updated last year
turboderp-org / exui
View on GitHub
Web UI for ExLlamaV2
☆513Feb 5, 2025Updated last year
abacaj / fine-tune-mistral
View on GitHub
Fine-tune mistral-7B on 3090s, a100s, h100s
☆735Oct 11, 2023Updated 2 years ago
guidance-ai / guidance
View on GitHub
A guidance language for controlling large language models.
☆21,694May 21, 2026Updated 2 months ago
ShishirPatil / gorilla
View on GitHub
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
☆12,962Apr 13, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
e-p-armstrong / augmentoolkit
View on GitHub
Create Custom LLMs
☆1,859Jun 27, 2026Updated 3 weeks ago
the-crypt-keeper / the-muse
View on GitHub
Experimental sampler to make LLMs more creative
☆31Aug 2, 2023Updated 2 years ago
epfml / landmark-attention
View on GitHub
Landmark Attention: Random-Access Infinite Context Length for Transformers
☆426Dec 20, 2023Updated 2 years ago
jzhang38 / TinyLlama
View on GitHub
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆9,017May 3, 2024Updated 2 years ago
gururise / AlpacaDataCleaned
View on GitHub
Alpaca dataset from Stanford, cleaned and curated
☆1,602Mar 7, 2026Updated 4 months ago
taylorai / galactic
View on GitHub
data cleaning and curation for unstructured text
☆329Aug 6, 2024Updated last year
imoneoi / multipack
View on GitHub
Multipack distributed sampler for fast padding-free training of LLMs
☆207Aug 10, 2024Updated last year