This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
☆94Nov 13, 2025Updated 6 months ago
Alternatives and similar repositories for MCTS-GSM8k-Demo
Users that are interested in MCTS-GSM8k-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆340Jun 5, 2025Updated 11 months ago
- Code for EACL 26 Findings paper "I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search"☆13Jan 28, 2026Updated 4 months ago
- ☆35Jun 5, 2025Updated 11 months ago
- MLLM @ Game☆16May 12, 2025Updated last year
- This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!☆20Aug 30, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- Simultaneous evaluation on both functionality and security of LLM-generated code.☆38Mar 6, 2026Updated 2 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆63Aug 30, 2024Updated last year
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆705Jan 20, 2025Updated last year
- ☆48May 9, 2026Updated 3 weeks ago
- AgentHub is the LLM API Hub for the Agent era, built for high-precision autonomous agents. (GPT-5.5/Claude 4.6/Gemini 3.1)☆91Updated this week
- alternative way to calculating self attention☆18May 25, 2024Updated 2 years ago
- About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…☆13Jan 14, 2026Updated 4 months ago
- Improving transparency of large language models' reasoning☆15Nov 25, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆1,342Nov 21, 2024Updated last year
- INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness☆15Nov 10, 2025Updated 6 months ago
- 汇编语言学习的例子☆10Aug 5, 2021Updated 4 years ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated last year
- ☆50Oct 28, 2024Updated last year
- Official repository for Activation-Informed Merging (AIM) of Large Language Models☆23Feb 10, 2025Updated last year
- An Open Large Reasoning Model for Real-World Solutions☆1,536Feb 13, 2026Updated 3 months ago
- Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…☆51Apr 9, 2024Updated 2 years ago
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆397Jan 19, 2025Updated last year
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆91Jan 3, 2024Updated 2 years ago
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆60Jul 23, 2024Updated last year
- ☆13Jul 2, 2025Updated 10 months ago
- This repo is the artifact of FUEL☆16May 19, 2026Updated last week
- ☆54Jul 16, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆31Sep 12, 2025Updated 8 months ago
- ☆12Feb 28, 2025Updated last year
- ☆15Mar 7, 2025Updated last year
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆136Mar 21, 2025Updated last year
- ☆48May 17, 2026Updated last week
- ☆28Aug 1, 2024Updated last year
- ☆16Jul 31, 2025Updated 9 months ago