This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
☆95Nov 13, 2025Updated 3 months ago
Alternatives and similar repositories for MCTS-GSM8k-Demo
Users that are interested in MCTS-GSM8k-Demo are comparing it to the libraries listed below
Sorting:
- About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…☆13Jan 14, 2026Updated last month
- ☆342Jun 5, 2025Updated 9 months ago
- 利用大语言模型进行卧底游戏,包括谁是卧底及衍生的发现AI卧底游戏等。☆11Sep 6, 2024Updated last year
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆51Nov 9, 2024Updated last year
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated 2 years ago
- MLX binary vectors and associated algorithms.☆14Mar 13, 2025Updated 11 months ago
- Linux操作系统学习笔记☆20Jan 11, 2024Updated 2 years ago
- [EMNLP 2023 (Findings)] Schema-adaptable Knowledge Graph Construction☆22Jan 28, 2024Updated 2 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Aug 30, 2024Updated last year
- MLLM @ Game☆16May 12, 2025Updated 9 months ago
- ☆32Jun 5, 2025Updated 9 months ago
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆18Apr 1, 2025Updated 11 months ago
- AgentHub is the only SDK you need to connect to state-of-the-art LLMs (GPT-5.2/Claude 4.6/Gemini 3.1).☆54Updated this week
- GPTQ inference TVM kernel☆40Apr 25, 2024Updated last year
- ☆19Nov 13, 2023Updated 2 years ago
- A Python reimplementation/extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆18Dec 1, 2023Updated 2 years ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated 10 months ago
- A Simple yet Effective Relation Information Guided Approach for Few-Shot Relation Extraction☆19May 30, 2022Updated 3 years ago
- ☆19Jun 13, 2024Updated last year
- ☆20Sep 28, 2024Updated last year
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆53Dec 13, 2025Updated 2 months ago
- Code base of In-Context Learning for Dialogue State tracking☆45Sep 24, 2023Updated 2 years ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆124Sep 9, 2024Updated last year
- An Open Large Reasoning Model for Real-World Solutions☆1,537Feb 13, 2026Updated 3 weeks ago
- ☆31Sep 12, 2025Updated 5 months ago
- NexAU (AU for Agent Universe), a general-purpose agent framework for building intelligent agents with tool capabilities.☆49Mar 2, 2026Updated last week
- Repository of paper "How Likely Do LLMs with CoT Mimic Human Reasoning?"☆23Feb 19, 2025Updated last year
- Official code for the paper "ADaPT: As-Needed Decomposition and Planning with Language Models"☆90Jan 3, 2024Updated 2 years ago
- ☆1,345Nov 21, 2024Updated last year
- Deertick Agent Management and Integration Toolbox (DAMIT)☆22Dec 31, 2025Updated 2 months ago
- ☆27Aug 1, 2024Updated last year
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆139Jun 12, 2024Updated last year
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆692Jan 20, 2025Updated last year
- ☆51Oct 28, 2024Updated last year
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Jan 8, 2026Updated 2 months ago
- ☆46Jun 24, 2025Updated 8 months ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆31Aug 18, 2024Updated last year
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆298Nov 16, 2024Updated last year