DeepMathLLM / DeepMathLinks
一个开源数学大模型项目,旨在探索大模型是否具有数学创造能力,以及大模型在前沿数学研究中的潜在能力。
☆14Updated 3 weeks ago
Alternatives and similar repositories for DeepMath
Users that are interested in DeepMath are comparing it to the libraries listed below
Sorting:
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆15Updated 3 weeks ago
- ☆19Updated this week
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆18Updated 3 months ago
- ☆43Updated 8 months ago
- ☆32Updated 5 months ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Updated 7 months ago
- Lottery Ticket Adaptation☆39Updated 6 months ago
- Bayes-Adaptive RL for LLM Reasoning☆23Updated last week
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆27Updated last week
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆45Updated 5 months ago
- Reinforcing General Reasoning without Verifiers☆51Updated last week
- ☆45Updated 3 months ago
- A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. It…☆41Updated 4 months ago
- Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025☆23Updated last month
- ☆16Updated 3 months ago
- ☆21Updated 8 months ago
- The original Shared Recurrent Memory Transformer implementation☆27Updated this week
- ☆18Updated last month
- ☆19Updated 2 months ago
- Official Implementation of APB (ACL 2025 main)☆28Updated 3 months ago
- ☆22Updated 11 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆21Updated 5 months ago
- ☆51Updated this week
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆10Updated 2 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆13Updated 2 months ago
- ☆40Updated 3 weeks ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 3 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆23Updated last week
- MPI Code Generation through Domain-Specific Language Models☆14Updated 6 months ago