WooooDyy/MathCritique

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WooooDyy/MathCritique)

WooooDyy / MathCritique

Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".

☆55

Alternatives and similar repositories for MathCritique

Users that are interested in MathCritique are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mathllm / Step-Controlled_DPO
View on GitHub
☆23Jul 5, 2024Updated 2 years ago
CreaLabs / Enhanced-BGE-M3-with-CLP-and-MoE
View on GitHub
This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…
☆11Dec 27, 2024Updated last year
THUDM / ReST-MCTS
View on GitHub
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆709Jan 20, 2025Updated last year
QwenLM / ProcessBench
View on GitHub
Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"
☆189May 20, 2025Updated last year
cs-holder / Reasoning-Self-Evolution-Survey
View on GitHub
☆54Mar 6, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
JIA-Lab-research / Step-DPO
View on GitHub
Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"
☆398Jan 19, 2025Updated last year
yanxue7 / E3T-Overcooked
View on GitHub
☆15May 4, 2024Updated 2 years ago
ljang0 / videowebarena
View on GitHub
☆14Dec 25, 2024Updated last year
genrm-star / genrm-critiques
View on GitHub
GenRM-CoT: Data release for verification rationales
☆68Oct 16, 2024Updated last year
SimpleBerry / LLaMA-O1
View on GitHub
Large Reasoning Models
☆803Dec 3, 2024Updated last year
open-compass / GPassK
View on GitHub
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆33Aug 5, 2025Updated 11 months ago
debjitpaul / Causal_CoT
View on GitHub
About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…
☆13Jan 14, 2026Updated 6 months ago
OpenBMB / Eurus
View on GitHub
☆322Sep 18, 2024Updated last year
Kun-Xiang / AtomThink
View on GitHub
[TPAMI 2026] Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"
☆66Nov 18, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zitian-gao / SC-MCTS
View on GitHub
Interpretable Contrastive Monte Carlo Tree Search Reasoning
☆52Nov 9, 2024Updated last year
yale-nlp / QTSumm
View on GitHub
Data and Code for EMNLP 2023 paper "QTSumm: Query-Focused Summarization over Tabular Data"
☆23Mar 29, 2024Updated 2 years ago
StarDewXXX / O1-Pruner
View on GitHub
Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning
☆99Feb 21, 2025Updated last year
HITsz-TMG / ICL-State-Vector
View on GitHub
☆12Jul 4, 2024Updated 2 years ago
hkgc-1 / GHPO
View on GitHub
☆62Jul 21, 2025Updated last year
cyzus / thoughtsculpt
View on GitHub
THOUGHTSCULPT, a general reasoning and search method for complex tasks
☆13Dec 13, 2024Updated last year
tengxiaoliu / LM_skip
View on GitHub
[NeurIPS 2024] Can Language Models Learn to Skip Steps?
☆21Jan 25, 2025Updated last year
rohinmanvi / Capability-Aware-and-Mid-Generation-Self-Evaluations
View on GitHub
☆21Jul 25, 2025Updated 11 months ago
marketdesignresearch / NOMU
View on GitHub
NOMU: Neural Optimization-based Model Uncertainty
☆10Feb 17, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
LaVi-Lab / LongContextReasoner
View on GitHub
[ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners
☆20May 28, 2024Updated 2 years ago
euxcet / thulearn2018
View on GitHub
Tools for Web Learning of Tsinghua University.
☆10Sep 17, 2024Updated last year
ZhangShiyue / extractive_is_not_faithful
View on GitHub
☆17May 19, 2023Updated 3 years ago
Exgc / R1V-Free
View on GitHub
R1V, trained with AI feedback, answers open-ended visual questions.
☆14Apr 12, 2025Updated last year
humblecoder612 / SAR_yolov3
View on GitHub
Best Accruacy:speed ratio SAR Ship detection in the world.
☆19May 3, 2020Updated 6 years ago
DIRECT-BIT / SRA-MCTS
View on GitHub
☆36Jun 5, 2025Updated last year
EsYoon7 / RLHF-TLCR
View on GitHub
[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
☆12Dec 6, 2024Updated last year
baopj / Vid-Morp
View on GitHub
☆12Dec 6, 2024Updated last year
vsubramaniam851 / multiagent-ft
View on GitHub
☆234Feb 24, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
baopj / E3M
View on GitHub
[ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.
☆11Jul 16, 2024Updated 2 years ago
LightChen233 / reasoning-boundary
View on GitHub
☆71Jun 18, 2025Updated last year
tuslkkk / tadpak
View on GitHub
Towards a Rigorous Evaluation of Time-series Anomaly Detection (AAAI'22)
☆32Feb 8, 2022Updated 4 years ago
uranus4ever / Vehicle-Detection
View on GitHub
Vehicle detection based on YOLO and SVM
☆15Jan 29, 2018Updated 8 years ago
THUDM / T1
View on GitHub
RL Scaling and Test-Time Scaling (ICML'25)
☆116Jan 23, 2025Updated last year
0xWJ / code-judge
View on GitHub
☆24Oct 10, 2025Updated 9 months ago
WooooDyy / BMMR
View on GitHub
Code and resources for the NeurIPS 2025 Paper "BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset" by Zhiheng X…
☆18Oct 14, 2025Updated 9 months ago