This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
☆95Nov 13, 2025Updated 7 months ago
Alternatives and similar repositories for MCTS-GSM8k-Demo
Users that are interested in MCTS-GSM8k-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆340Jun 5, 2025Updated last year
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆19Apr 1, 2025Updated last year
- Code for EACL 26 Findings paper "I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search"☆13Jan 28, 2026Updated 4 months ago
- ☆36Jun 5, 2025Updated last year
- MLLM @ Game☆17May 12, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a repository of Binary General Matrix Multiply (BGEMM) by customized CUDA kernel. Thank FP6-LLM for the wheels!☆20Aug 30, 2024Updated last year
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- Simultaneous evaluation on both functionality and security of LLM-generated code.☆39Mar 6, 2026Updated 3 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆706Jan 20, 2025Updated last year
- ☆49May 9, 2026Updated last month
- alternative way to calculating self attention☆18May 25, 2024Updated 2 years ago
- Improving transparency of large language models' reasoning☆15Nov 25, 2025Updated 6 months ago
- ☆1,342Nov 21, 2024Updated last year
- INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness☆15Jun 2, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated last year
- ☆50Oct 28, 2024Updated last year
- Official repository for Activation-Informed Merging (AIM) of Large Language Models☆23Feb 10, 2025Updated last year
- An Open Large Reasoning Model for Real-World Solutions☆1,540Updated this week
- Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…☆51Apr 9, 2024Updated 2 years ago
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- The official repository of ICCV 2025 paper "CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning".☆20Nov 26, 2025Updated 6 months ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆60Jul 23, 2024Updated last year
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆303Nov 16, 2024Updated last year
- ☆13Jul 2, 2025Updated 11 months ago
- Data and code for "Chain-of-Thought in Neural Code Generation: From and For Lightweight Language Models", which accepted in TSE.☆15Jul 3, 2024Updated last year
- This repo is the artifact of FUEL☆16May 19, 2026Updated last month
- A Simple yet Effective Relation Information Guided Approach for Few-Shot Relation Extraction☆19May 30, 2022Updated 4 years ago
- ☆54Jul 16, 2024Updated last year
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆32Aug 18, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆31Sep 12, 2025Updated 9 months ago
- ☆12Feb 28, 2025Updated last year
- ☆28Sep 15, 2025Updated 9 months ago
- [ICCAD 2024] SNNGX: Securing Spiking Neural Networks with Genetic XOR Encryption on RRAM-based Neuromorphic Accelerator☆11Feb 3, 2026Updated 4 months ago
- Practical Claude Code skills — English-for-engineers coaching, Pi Agent setup, and more. Install via npx skills add.☆49Updated this week
- ☆28Aug 1, 2024Updated last year
- ☆47Jun 24, 2025Updated 11 months ago