DeepMathLLM / DeepMathLinks
一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。
☆16Updated 4 months ago
Alternatives and similar repositories for DeepMath
Users that are interested in DeepMath are comparing it to the libraries listed below
Sorting:
- Bayes-Adaptive RL for LLM Reasoning☆39Updated 4 months ago
- Resa: Transparent Reasoning Models via SAEs☆41Updated this week
- ☆42Updated last year
- ☆67Updated 2 months ago
- ☆40Updated 3 months ago
- ☆27Updated 3 months ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆16Updated 2 weeks ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆31Updated this week
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆22Updated last month
- The original Shared Recurrent Memory Transformer implementation☆31Updated 2 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆20Updated 6 months ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆37Updated last week
- ☆42Updated last month
- ☆34Updated last month
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆19Updated 3 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 6 months ago
- Geometric-Mean Policy Optimization☆80Updated last month
- LIMI: Less is More for Agency☆69Updated this week
- Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environme…☆19Updated 10 months ago
- 😊 TPTT: Transforming Pretrained Transformers into Titans☆27Updated last week
- ☆23Updated last week
- Large language models designed for formal theorem proving through tool-integrated reasoning.☆28Updated last month
- ☆22Updated last year
- [NeurIPS 2025 Oral] Exploring Diffusion Transformer Designs via Grafting☆52Updated 3 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆117Updated last week
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆32Updated 3 weeks ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆83Updated 3 months ago
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆54Updated 10 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆12Updated 9 months ago
- ☆19Updated 6 months ago