Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
โ459Apr 18, 2024Updated last year
Alternatives and similar repositories for rho
Users that are interested in rho are comparing it to the libraries listed below
Sorting:
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐งฎโจโ273Apr 26, 2024Updated last year
- [ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (As Huggingface Daily Papers: โฆโ90Nov 23, 2025Updated 3 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Modelsโ201Dec 8, 2025Updated 2 months ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]โ588Dec 9, 2024Updated last year
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]โ147Sep 20, 2024Updated last year
- โ321Sep 18, 2024Updated last year
- โ565Nov 20, 2024Updated last year
- โ64Apr 9, 2024Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"โ48Jan 17, 2024Updated 2 years ago
- โ30Dec 27, 2024Updated last year
- The code and data for the paper JiuZhang3.0โ49May 26, 2024Updated last year
- โ109Jul 15, 2025Updated 7 months ago
- Code for Quiet-STaRโ741Aug 21, 2024Updated last year
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"โ446Oct 16, 2024Updated last year
- Official Repo for Open-Reasoner-Zeroโ2,087Jun 2, 2025Updated 8 months ago
- Simple RL training for reasoningโ3,830Dec 23, 2025Updated 2 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuningโ512Oct 20, 2024Updated last year
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]โ79Nov 14, 2024Updated last year
- [ACL 2024] Progressive LLaMA with Block Expansion.โ514May 20, 2024Updated last year
- Data and tools for generating and inspecting OLMo pre-training data.โ1,411Nov 5, 2025Updated 3 months ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factualityโ231Aug 2, 2024Updated last year
- [NeurIPS'24] Official code for *๐ฏDART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*โ120Dec 10, 2024Updated last year
- Scaling Data-Constrained Language Modelsโ342Jun 28, 2025Updated 8 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]โ149Oct 27, 2024Updated last year
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting witโฆโ1,113Feb 22, 2024Updated 2 years ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.โ753Sep 27, 2024Updated last year
- AllenAI's post-training codebaseโ3,592Updated this week
- AnchorAttention: Improved attention for LLMs long-context trainingโ214Jan 15, 2025Updated last year
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"โ391Jan 19, 2025Updated last year
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.โ2,903Updated this week
- FuseAI Projectโ590Jan 25, 2025Updated last year
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other moโฆโ416Jun 25, 2025Updated 8 months ago
- An unofficial implementation of "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"โ36Jun 7, 2024Updated last year
- A Survey on Data Selection for Language Modelsโ253Apr 29, 2025Updated 10 months ago
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruningโ641Mar 4, 2024Updated last year
- โ1,104Jan 10, 2026Updated last month
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scaleโ266Jul 8, 2025Updated 7 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuningโ189Jun 25, 2025Updated 8 months ago
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" [ICLR 2024]โ383Aug 25, 2024Updated last year