CodeCreator / WebOrganizer
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
☆42Updated last month
Alternatives and similar repositories for WebOrganizer:
Users that are interested in WebOrganizer are comparing it to the libraries listed below
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]☆63Updated 5 months ago
- Exploration of automated dataset selection approaches at large scales.☆37Updated last month
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆72Updated 4 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 6 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆164Updated 9 months ago
- Long Context Extension and Generalization in LLMs☆53Updated 6 months ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆174Updated last month
- ☆56Updated last month
- ☆70Updated 5 months ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆30Updated 10 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆109Updated 9 months ago
- ☆98Updated 6 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆61Updated 5 months ago
- ☆126Updated 5 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆45Updated 2 months ago
- LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models☆74Updated 6 months ago
- ☆45Updated last month
- The HELMET Benchmark☆127Updated this week
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆34Updated 4 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆74Updated 10 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆105Updated last month
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆90Updated this week
- ☆48Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆80Updated 8 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆46Updated last month
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆34Updated 7 months ago
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆57Updated 4 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆137Updated 5 months ago
- ☆41Updated 8 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆47Updated 3 months ago