NumberChiffre/mcts-llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NumberChiffre/mcts-llm)

NumberChiffre / mcts-llm

☆98

Alternatives and similar repositories for mcts-llm

Users that are interested in mcts-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

naivoder / MCTSr
View on GitHub
Monte Carlo Tree Search Self-Refine (MCTSr)
☆21Jul 6, 2024Updated 2 years ago
SidU / MathBlackBox
View on GitHub
☆11Jul 21, 2024Updated last year
BrendanGraham14 / mcts-llm
View on GitHub
☆130Jun 18, 2024Updated 2 years ago
ack-sec / toyberry
View on GitHub
Toy implementation of Strawberry
☆33Sep 24, 2024Updated last year
ernie-research / Tool-Augmented-Reward-Model
View on GitHub
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆54Jun 6, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
THUDM / ReST-MCTS
View on GitHub
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆709Jan 20, 2025Updated last year
trotsky1997 / MathBlackBox
View on GitHub
☆1,033Dec 17, 2024Updated last year
HappyGu0524 / Controllable-Text-Generation
View on GitHub
☆16Oct 5, 2022Updated 3 years ago
lipiji / dialogue-hred-vhred
View on GitHub
HRED VHRED VHCR for Multi-Turn Dialogue Systems
☆43Dec 16, 2019Updated 6 years ago
openreasoner / openr
View on GitHub
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
☆1,848Jan 17, 2025Updated last year
ictnlp / RSI-NAT
View on GitHub
Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"
☆18Aug 31, 2019Updated 6 years ago
hkust-nlp / deita
View on GitHub
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆599Dec 9, 2024Updated last year
zhentingqi / rStar
View on GitHub
☆972Jan 23, 2025Updated last year
SIMONLQY / RethinkMCTS
View on GitHub
☆34Oct 2, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
wangle1218 / NLP-Interview-Notes
View on GitHub
本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。
☆15Mar 9, 2021Updated 5 years ago
zbambergerNLP / strategic-debate-tot
View on GitHub
A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments
☆103Oct 3, 2025Updated 9 months ago
1989Ryan / llm-mcts
View on GitHub
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…
☆303Nov 16, 2024Updated last year
chenllliang / G1
View on GitHub
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
☆103May 20, 2025Updated last year
THU-BPM / Pinocchio
View on GitHub
Dataset Pinocchio for paper "Towards Understanding Factual Knowledge of Large Language Models" accepted by ICLR 2024 (Spotlight)
☆12Mar 13, 2024Updated 2 years ago
chridey / altlex
View on GitHub
☆11Apr 4, 2018Updated 8 years ago
zhliu0106 / probing-lm-data
View on GitHub
Official Implementation of "Probing Language Models for Pre-training Data Detection"
☆20Dec 4, 2024Updated last year
icip-cas / Verifier-Engineering
View on GitHub
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
☆63Dec 5, 2024Updated last year
OSU-NLP-Group / LLM-Knowledge-Conflict
View on GitHub
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
☆84Apr 12, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
GAIR-NLP / O1-Journey
View on GitHub
O1 Replication Journey
☆2,000Jan 14, 2025Updated last year
tatsu-lab / test_set_contamination
View on GitHub
☆43Nov 7, 2023Updated 2 years ago
lil-lab / icrl
View on GitHub
☆33Feb 10, 2025Updated last year
ur-whitelab / alcfd
View on GitHub
Active learning symbolic regression CFD + AI = Wow
☆17Apr 21, 2022Updated 4 years ago
maitrix-org / llm-reasoners
View on GitHub
A library for advanced large language model reasoning
☆2,339Jun 10, 2025Updated last year
IIEKES / MLM_transfer
View on GitHub
☆17Oct 9, 2022Updated 3 years ago
UCSB-NLP-Chang / PromptBoosting
View on GitHub
☆17Sep 5, 2023Updated 2 years ago
MARIO-Math-Reasoning / Super_MARIO
View on GitHub
☆341Jun 5, 2025Updated last year
causalNLP / AI-Scholar
View on GitHub
☆23Dec 8, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
kevinscaria / TarGEN
View on GitHub
Targeted Data Generation with Large Language Models
☆19Jun 25, 2024Updated 2 years ago
GAIR-NLP / ProX
View on GitHub
[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale
☆271Jul 8, 2025Updated last year
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,828Jul 14, 2026Updated last week
thunlp / SememeWSD
View on GitHub
Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"
☆14Dec 2, 2020Updated 5 years ago
lqtrung1998 / mwp_ReFT
View on GitHub
☆554Jan 2, 2025Updated last year
Open-Source-O1 / Open-O1
View on GitHub
☆1,340Nov 21, 2024Updated last year