tongjingqi / Code2LogicLinks

Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning

☆74

Alternatives and similar repositories for Code2Logic

Users that are interested in Code2Logic are comparing it to the libraries listed below

Sorting:

ssmisya / PRMBench
[ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.
☆81Updated 7 months ago
hkust-nlp / Laser
Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
☆55Updated 3 months ago
OpenBMB / RLPR
Extrapolating RLVR to General Domains without Verifiers
☆160Updated last month
IAAR-Shanghai / xVerify
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
☆128Updated 5 months ago
PRIME-RL / Entropy-Mechanism-of-RL
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
☆326Updated 2 months ago
RyanLiu112 / Awesome-Process-Reward-Models
A comprehensive collection of process reward models.
☆108Updated 2 months ago
WooooDyy / MathCritique
Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".
☆56Updated 9 months ago
GAIR-NLP / LIMR
☆209Updated 7 months ago
InfiMM / Awesome-Multimodal-LLM-for-Math-STEM
Paper collections of multi-modal LLM for Math/STEM/Code.
☆127Updated last month
RyanLiu112 / GenPRM
Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".
☆81Updated 3 months ago
XiaoYee / Awesome_Efficient_LRM_Reasoning
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
☆294Updated last week
RM-R1-UIUC / RM-R1
RM-R1: Unleashing the Reasoning Potential of Reward Models
☆134Updated 2 months ago
yafuly / TPO
Test-time preferenece optimization (ICML 2025).
☆165Updated 4 months ago
ShadeCloak / ADORA
☆46Updated 5 months ago
ElliottYan / LUFFY
Official Repository of "Learning to Reason under Off-Policy Guidance"
☆301Updated 2 weeks ago
ltzheng / SimpleTIR
End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆273Updated last week
Blueyee / Efficient-CoT-LRMs
Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!
☆69Updated 5 months ago
InternLM / POLAR
Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.
☆152Updated last week
OpenMOSS / Thus-Spake-Long-Context-LLM
a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation
☆56Updated 5 months ago
THU-KEG / AdaptThink
☆152Updated 3 months ago
xuyige / SoftCoT
ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…
☆46Updated 3 months ago
Hongcheng-Gao / Awesome-Long2short-on-LRMs
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…
☆245Updated last month
iie-ycx / DEER
This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.
☆171Updated 2 months ago
njucckevin / MM-Self-Improve
A Self-Training Framework for Vision-Language Reasoning
☆84Updated 7 months ago
hkust-nlp / mstar
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆69Updated 2 months ago
ChartMimic / ChartMimic
[ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation
☆123Updated 3 months ago
QingyangZhang / Label-Free-RLVR
☆267Updated 2 months ago
hahahawu / Long-to-Short-via-Model-Merging
Model merging is a highly efficient approach for long-to-short reasoning.
☆82Updated 3 months ago
KbsdJames / Omni-MATH
The official repository of the Omni-MATH benchmark.
☆87Updated 8 months ago
xiaomi-research / colar
[arXiv2505] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains
☆50Updated last month