MoonshotAI / Kimina-Prover-Preview
Technical report of Kimina-Prover Preview.
☆231Updated last week
Alternatives and similar repositories for Kimina-Prover-Preview:
Users that are interested in Kimina-Prover-Preview are comparing it to the libraries listed below
- ☆175Updated 3 weeks ago
- The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"☆73Updated 3 weeks ago
- ☆50Updated 3 weeks ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆229Updated last week
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆101Updated 4 months ago
- Retrieval-Augmented Theorem Provers for Lean☆267Updated 2 months ago
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆141Updated 9 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆152Updated last week
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆95Updated last month
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆331Updated this week
- The official repository of the Omni-MATH benchmark.☆80Updated 4 months ago
- The official repository for the paper Multilingual Mathematical Autoformalization☆35Updated 11 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆182Updated last week
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆191Updated last month
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆173Updated last month
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆122Updated 9 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆882Updated last week
- ☆283Updated last month
- An Open Math Pre-trainng Dataset with 370B Tokens.☆72Updated 3 weeks ago
- Async pipelined version of Verl☆60Updated 2 weeks ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆139Updated this week
- AI for Mathematics (AI4Math) paper list☆158Updated 6 months ago
- ☆519Updated last week
- ☆42Updated 7 months ago
- ☆149Updated 4 months ago
- MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems☆86Updated 9 months ago
- ☆187Updated 2 months ago
- Efficient triton implementation of Native Sparse Attention.☆139Updated 2 weeks ago
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆59Updated last year