☆85Jul 10, 2024Updated last year
Alternatives and similar repositories for CMU_MATH-AIMO
Users that are interested in CMU_MATH-AIMO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple and efficient pytorch-native transformer training and inference (batched)☆78Apr 2, 2024Updated 2 years ago
- ☆494Jul 22, 2024Updated last year
- ☆14Mar 11, 2024Updated 2 years ago
- ☆25Jun 2, 2026Updated 3 weeks ago
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆50Aug 7, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Get start with RL, today☆15Sep 9, 2025Updated 9 months ago
- ☆14Jul 17, 2025Updated 11 months ago
- The official repository for the paper Multilingual Mathematical Autoformalization☆38May 20, 2024Updated 2 years ago
- ☆52Mar 5, 2025Updated last year
- Uncertainty quantification for in-context learning of large language models☆15Apr 1, 2024Updated 2 years ago
- ☆56Nov 22, 2024Updated last year
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆35Aug 6, 2023Updated 2 years ago
- This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…☆32Dec 5, 2024Updated last year
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆1,118Feb 22, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR'26] "Nabla-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space" by Peihao Wang*, Ruisi Cai*, Zhen Wang, Hongyuan…☆35Mar 10, 2026Updated 3 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆66Feb 29, 2024Updated 2 years ago
- [ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (https://huggingface.co/papers…☆91Nov 23, 2025Updated 7 months ago
- Code of ICML paper arxiv.org/abs/2302.08105☆14May 4, 2023Updated 3 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆16Jun 28, 2024Updated 2 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated 2 years ago
- ☆13Aug 9, 2022Updated 3 years ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆277Apr 26, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆24Aug 27, 2025Updated 10 months ago
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆31Jan 10, 2026Updated 5 months ago
- Companion code for a tutorial on using Hydra.☆33May 24, 2021Updated 5 years ago
- ☆1,034Dec 17, 2024Updated last year
- ☆22Jan 14, 2026Updated 5 months ago
- 👌[ICLR 2025] TFG-Flow: Training-free Guidance in Multimodal Generative Flow☆20Mar 4, 2025Updated last year
- The Mixing method: coordinate descent for low-rank semidefinite programming☆15Apr 30, 2021Updated 5 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- chinese pretrain unilm☆28Apr 14, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆26Aug 23, 2024Updated last year
- ☆14Oct 21, 2024Updated last year
- DeepAlgebra☆26Oct 26, 2017Updated 8 years ago
- ☆351May 24, 2025Updated last year
- Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math…☆73Jul 27, 2024Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆63Nov 27, 2024Updated last year