imagination-research / aimo2
AIMO2 2nd place solution
☆46Updated last week
Alternatives and similar repositories for aimo2:
Users that are interested in aimo2 are comparing it to the libraries listed below
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆42Updated 5 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆128Updated last month
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆175Updated last month
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆203Updated 11 months ago
- Reproducing R1 for Code with Reliable Rewards☆173Updated 2 weeks ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆61Updated 5 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆101Updated 4 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆105Updated 2 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆190Updated last month
- ☆51Updated last week
- ☆57Updated last month
- A Comprehensive Survey on Long Context Language Modeling☆131Updated 3 weeks ago
- ☆149Updated 4 months ago
- ☆185Updated 2 months ago
- Async pipelined version of Verl☆60Updated 2 weeks ago
- ☆62Updated 5 months ago
- ☆125Updated 3 weeks ago
- ☆63Updated 4 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆137Updated 2 months ago
- Repository of LV-Eval Benchmark☆63Updated 7 months ago
- Simple extension on vLLM to help you speed up reasoning model without training.☆146Updated this week
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆154Updated 10 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆122Updated 9 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆147Updated 7 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆115Updated last month
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆92Updated last month
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆102Updated last month
- ☆98Updated 6 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆171Updated last month
- Long Context Extension and Generalization in LLMs☆53Updated 7 months ago