[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆266Jul 8, 2025Updated 7 months ago
Alternatives and similar repositories for ProX
Users that are interested in ProX are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆107Mar 6, 2025Updated 11 months ago
- BeHonest: Benchmarking Honesty in Large Language Models☆34Aug 15, 2024Updated last year
- Trending projects & awesome papers about data-centric llm studies.☆40May 20, 2025Updated 9 months ago
- ☆13Jul 14, 2024Updated last year
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆588Dec 9, 2024Updated last year
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆42Jul 19, 2024Updated last year
- Safety-J: Evaluating Safety with Critique☆16Jul 28, 2024Updated last year
- ☆58Sep 2, 2024Updated last year
- Evaluate the Quality of Critique☆36Jun 1, 2024Updated last year
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆201Dec 8, 2025Updated 2 months ago
- O1 Replication Journey☆1,999Jan 14, 2025Updated last year