OFA-Sys/InsTag

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OFA-Sys/InsTag)

OFA-Sys / InsTag

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

☆285

Alternatives and similar repositories for InsTag

Users that are interested in InsTag are comparing it to the libraries listed below

Sorting:

hkust-nlp / deita
View on GitHub
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆588Dec 9, 2024Updated last year
tianyi-lab / Cherry_LLM
View on GitHub
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆416Jun 25, 2025Updated 8 months ago
QwenLM / AutoIF
View on GitHub
☆324Jul 25, 2024Updated last year
magpie-align / magpie
View on GitHub
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …
☆829Mar 17, 2025Updated 11 months ago
YJiangcm / FollowBench
View on GitHub
[ACL 2024] FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
☆119Jun 12, 2025Updated 8 months ago
tianyi-lab / Superfiltering
View on GitHub
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆189Jun 25, 2025Updated 8 months ago
OFA-Sys / gsm8k-ScRel
View on GitHub
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆270Sep 12, 2024Updated last year
thu-coai / ComplexBench
View on GitHub
Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)
☆102Feb 20, 2025Updated last year
princeton-nlp / LESS
View on GitHub
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
☆512Oct 20, 2024Updated last year
OFA-Sys / Ditto
View on GitHub
A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters…
☆211May 28, 2024Updated last year
princeton-nlp / QuRating
View on GitHub
[ICML 2024] Selecting High-Quality Data for Training Language Models
☆201Dec 8, 2025Updated 2 months ago
ZigeW / data_management_LLM
View on GitHub
Collection of training data management explorations for large language models
☆336Aug 2, 2024Updated last year
GanjinZero / RRHF
View on GitHub
[NIPS2023] RRHF & Wombat
☆809Sep 22, 2023Updated 2 years ago
RLHFlow / RLHF-Reward-Modeling
View on GitHub
Recipes to train reward model for RLHF.
☆1,515Apr 24, 2025Updated 10 months ago
open-compass / opencompass
View on GitHub
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …
☆6,688Updated this week
tianyi-lab / Reflection_Tuning
View on GitHub
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
☆367Sep 6, 2024Updated last year
opendatalab / WanJuan1.0
View on GitHub
万卷1.0多模态语料
☆569Oct 20, 2023Updated 2 years ago
meowpass / FollowComplexInstruction
View on GitHub
Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…
☆53Jun 24, 2024Updated last year
THUDM / ChatGLM-Math
View on GitHub
☆84Apr 18, 2024Updated last year
OpenBMB / UltraFeedback
View on GitHub
A large-scale, fine-grained, diverse preference dataset (and models).
☆363Dec 29, 2023Updated 2 years ago
alibaba / ChatLearn
View on GitHub
A flexible and efficient training framework for large-scale alignment tasks
☆450Oct 23, 2025Updated 4 months ago
THUDM / AlignBench
View on GitHub
大模型多维度中文对齐评测基准 (ACL 2024)
☆420Oct 25, 2025Updated 4 months ago
allenai / dolma
View on GitHub
Data and tools for generating and inspecting OLMo pre-training data.
☆1,411Nov 5, 2025Updated 3 months ago
allenai / reward-bench
View on GitHub
RewardBench: the first evaluation tool for reward models.
☆696Feb 16, 2026Updated last week
2003pro / TAGCOS
View on GitHub
This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
☆13Jul 21, 2024Updated last year
Open-Reasoner-Zero / Open-Reasoner-Zero
View on GitHub
Official Repo for Open-Reasoner-Zero
☆2,087Jun 2, 2025Updated 8 months ago
openai / prm800k
View on GitHub
800,000 step-level correctness labels on LLM solutions to MATH problems
☆2,091Jun 1, 2023Updated 2 years ago
huggingface / cosmopedia
View on GitHub
☆565Nov 20, 2024Updated last year
huggingface / Math-Verify
View on GitHub
☆1,104Jan 10, 2026Updated last month
AetherCortex / Llama-X
View on GitHub
Open Academic Research on Improving LLaMA to SOTA LLM
☆1,611Aug 30, 2023Updated 2 years ago
microsoft / rho
View on GitHub
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆459Apr 18, 2024Updated last year
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
☆9,037Feb 21, 2026Updated last week
FreedomIntelligence / OVM
View on GitHub
☆72Apr 2, 2024Updated last year
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,592Updated this week
thu-coai / CritiqueLLM
View on GitHub
☆148Jul 1, 2024Updated last year
THUDM / LongAlign
View on GitHub
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
☆260Dec 16, 2024Updated last year
hkust-nlp / dart-math
View on GitHub
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
☆120Dec 10, 2024Updated last year
pldlgb / nuggets
View on GitHub
☆87Dec 29, 2023Updated 2 years ago
JIA-Lab-research / Step-DPO
View on GitHub
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
☆391Jan 19, 2025Updated last year