tongjingqi / MathTrap
☆58Updated last week
Alternatives and similar repositories for MathTrap:
Users that are interested in MathTrap are comparing it to the libraries listed below
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆65Updated this week
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆51Updated 3 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆119Updated 8 months ago
- SOTA RL fine-tuning solution for advanced math reasoning of LLM☆91Updated this week
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆153Updated this week
- The official repository of the Omni-MATH benchmark.☆77Updated 3 months ago
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆43Updated last week
- ☆104Updated 6 months ago
- The official code repository for PRMBench.☆68Updated last month
- A research repo for experiments about Reinforcement Finetuning☆36Updated last week
- ☆54Updated 5 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆189Updated 11 months ago
- Awesome RL-based LLM Reasoning☆341Updated this week
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆49Updated 8 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆212Updated this week
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆97Updated 8 months ago
- ☆166Updated last month
- ☆88Updated this week
- The official repository of "Whoever Started the Interference Should End It: Guiding Data-Free Model Merging via Task Vectors""☆14Updated 2 weeks ago
- ☆117Updated last week
- ☆186Updated this week
- Paper list for Efficient Reasoning.☆311Updated this week
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆41Updated 2 weeks ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆165Updated this week
- A Comprehensive Survey on Long Context Language Modeling☆86Updated last week
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆61Updated last month
- The related works and background techniques about Openai o1☆217Updated 2 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆158Updated this week
- ☆71Updated this week
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆74Updated 2 weeks ago