tongjingqi / MathTrap
โ58Updated 2 months ago
Alternatives and similar repositories for MathTrap:
Users that are interested in MathTrap are comparing it to the libraries listed below
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. ๐งฎโจโ173Updated 9 months ago
- The official repository of the Omni-MATH benchmark.โ71Updated last month
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scieโฆโ126Updated 7 months ago
- โ320Updated 2 weeks ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied witโฆโ114Updated 7 months ago
- [NeurIPS'24] Official code for *๐ฏDART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*โ94Updated 2 months ago
- The related works and background techniques about Openai o1โ210Updated last month
- The official code repository for PRMBench.โ64Updated this week
- FeatureAlignment = Alignment + Mechanistic Interpretabilityโ28Updated last month
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".โ52Updated 2 months ago
- โ24Updated last year
- โ130Updated 2 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoningโ42Updated 3 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.โ289Updated 6 months ago
- The awesome agents in the era of large language modelsโ59Updated last year
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feeโฆโ38Updated 6 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.โ152Updated this week
- โ45Updated 4 months ago
- SOTA Math Opensource LLMโ331Updated last year
- โ82Updated last month
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"โ104Updated 5 months ago
- โ26Updated 9 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..โ201Updated 4 months ago
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Datasetโ95Updated 7 months ago
- Repo of paper "Free Process Rewards without Process Labels"โ123Updated last month