hkust-nlp / deita
View external linksLinks

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

☆588

Alternatives and similar repositories for deita

Users that are interested in deita are comparing it to the libraries listed below

Sorting:

tianyi-lab / Cherry_LLM
View on GitHub
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆416Jun 25, 2025Updated 7 months ago
tianyi-lab / Superfiltering
View on GitHub
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆188Jun 25, 2025Updated 7 months ago
CASIA-LM / MoDS
View on GitHub
☆147Apr 16, 2024Updated last year
princeton-nlp / LESS
View on GitHub
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
☆511Oct 20, 2024Updated last year
OFA-Sys / InsTag
View on GitHub
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
☆285Aug 20, 2023Updated 2 years ago
pldlgb / nuggets
View on GitHub
☆87Dec 29, 2023Updated 2 years ago
magpie-align / magpie
View on GitHub
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …
☆826Mar 17, 2025Updated 10 months ago
hkust-nlp / dart-math
View on GitHub
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
☆120Dec 10, 2024Updated last year
pjlab-sys4nlp / llama-moe
View on GitHub
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
☆1,003Dec 6, 2024Updated last year
QwenLM / AutoIF
View on GitHub
☆322Jul 25, 2024Updated last year
princeton-nlp / QuRating
View on GitHub
[ICML 2024] Selecting High-Quality Data for Training Language Models
☆201Dec 8, 2025Updated 2 months ago
GAIR-NLP / ReAlign
View on GitHub
Reformatted Alignment
☆111Sep 23, 2024Updated last year
hkust-nlp / mstar
View on GitHub
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆70Jul 13, 2025Updated 7 months ago
IronBeliever / CaR
View on GitHub
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
☆90Nov 13, 2024Updated last year
GAIR-NLP / ProX
View on GitHub
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆266Jul 8, 2025Updated 7 months ago
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
☆8,989Feb 6, 2026Updated last week
TencentARC / LLaMA-Pro
View on GitHub
[ACL 2024] Progressive LLaMA with Block Expansion.
☆514May 20, 2024Updated last year
microsoft / rho
View on GitHub
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆459Apr 18, 2024Updated last year
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,573Updated this week
jzhang38 / EasyContext
View on GitHub
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆752Sep 27, 2024Updated last year
meowpass / FollowComplexInstruction
View on GitHub
Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…
☆53Jun 24, 2024Updated last year
hkust-nlp / llm-compression-intelligence
View on GitHub
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
☆147Sep 20, 2024Updated last year
tianyi-lab / Reflection_Tuning
View on GitHub
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
☆366Sep 6, 2024Updated last year
HKUNLP / ChunkLlama
View on GitHub
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆445Oct 16, 2024Updated last year
huggingface / alignment-handbook
View on GitHub
Robust recipes to align language models with human and AI preferences
☆5,495Sep 8, 2025Updated 5 months ago
Blue-Raincoat / SelectIT
View on GitHub
☆24Oct 14, 2024Updated last year
IBM / SALMON
View on GitHub
Self-Alignment with Principle-Following Reward Models
☆169Sep 18, 2025Updated 4 months ago
GAIR-NLP / O1-Journey
View on GitHub
O1 Replication Journey
☆1,999Jan 14, 2025Updated last year
neelsjain / NEFTune
View on GitHub
Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning
☆409May 17, 2024Updated last year
ContextualAI / HALOs
View on GitHub
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
☆905Sep 30, 2025Updated 4 months ago
hkust-nlp / simpleRL-reason
View on GitHub
Simple RL training for reasoning
☆3,827Dec 23, 2025Updated last month
xfactlab / orpo
View on GitHub
Official repository for ORPO
☆471May 31, 2024Updated last year
open-compass / opencompass
View on GitHub
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …
☆6,663Updated this week
Xwin-LM / Xwin-LM
View on GitHub
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
☆1,036May 31, 2024Updated last year
allenai / reward-bench
View on GitHub
RewardBench: the first evaluation tool for reward models.
☆687Jan 31, 2026Updated 2 weeks ago
chujiezheng / LLM-Extrapolation
View on GitHub
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆75May 20, 2025Updated 8 months ago
locuslab / scaling_laws_data_filtering
View on GitHub
☆64Apr 9, 2024Updated last year
RLHFlow / RLHF-Reward-Modeling
View on GitHub
Recipes to train reward model for RLHF.
☆1,512Apr 24, 2025Updated 9 months ago
feiyang-k / AutoScale
View on GitHub
Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…
☆13Aug 8, 2025Updated 6 months ago

hkust-nlp / deitaView external linksLinks

Alternatives and similar repositories for deita

hkust-nlp / deita
View external linksLinks