rmshin/llm-mcts

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rmshin/llm-mcts)

rmshin / llm-mcts

☆40

Alternatives and similar repositories for llm-mcts

Users that are interested in llm-mcts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shunzh / mcts-for-llm
View on GitHub
This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.
☆16Jun 28, 2024Updated 2 years ago
cavaunpeu / mcts-llm-codegen
View on GitHub
A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)
☆17Dec 1, 2023Updated 2 years ago
rohinmanvi / Capability-Aware-and-Mid-Generation-Self-Evaluations
View on GitHub
☆21Jul 25, 2025Updated last year
scandukuri / assistant-gate
View on GitHub
☆28May 29, 2024Updated 2 years ago
google-deepmind / icml2024-roundtrip-correctness
View on GitHub
☆17Jun 18, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
wangruicn / DialogueCSE
View on GitHub
DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings
☆19Nov 24, 2021Updated 4 years ago
YuxiXie / MCTS-DPO
View on GitHub
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
☆331Jan 29, 2026Updated 6 months ago
patched-codes / semgrep-rules
View on GitHub
A collection of permissively licensed Semgrep rules.
☆25Jul 5, 2024Updated 2 years ago
DeepSoftwareAnalytics / Telly
View on GitHub
Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond
☆23Apr 9, 2023Updated 3 years ago
google-research-datasets / QuoteSum
View on GitHub
QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …
☆13Mar 25, 2024Updated 2 years ago
tianjunz / HIR
View on GitHub
☆157Mar 18, 2023Updated 3 years ago
microsoft / prose-benchmarks
View on GitHub
PROSE Public Benchmark Suite
☆35Sep 15, 2025Updated 10 months ago
securade / sentinel
View on GitHub
Securade.ai Sentinel - A monitoring and surveillance application that enables visual Q&A and video captioning for existing CCTV cameras.
☆31Apr 6, 2025Updated last year
nirgreshler / bayesian-online-planning
View on GitHub
The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.
☆13Jun 17, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
1989Ryan / llm-mcts
View on GitHub
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…
☆303Nov 16, 2024Updated last year
abaheti95 / QADialogSystem
View on GitHub
We design models that generate conversational responses for factual questions using expert answer phrases from Question Answering systems…
☆21Jul 2, 2020Updated 6 years ago
yudasong / briee
View on GitHub
Representation Learning in RL
☆13Jun 1, 2022Updated 4 years ago
allenai / hyper-task-descriptions
View on GitHub
Learning adapter weights from task descriptions
☆20Nov 12, 2023Updated 2 years ago
shunzh / Code-AI-Tree-Search
View on GitHub
☆118Jul 17, 2024Updated 2 years ago
rmlarose / qcbq
View on GitHub
Quantum computing bootcamp with Qiskit
☆13Jul 6, 2023Updated 3 years ago
Zhaoyilunnn / q-gpu
View on GitHub
moved to https://github.com/Zhaoyilunnn/qdao
☆10Aug 30, 2023Updated 2 years ago
ASSERT-KTH / SelfAPR
View on GitHub
repo of "SelfAPR: Self-supervised Program Repair with Test Execution Diagnostics" (ASE 22) https://oadoi.org/10.1145/3551349.3556926
☆29Mar 4, 2024Updated 2 years ago
bkiani / Beyond-Barren-Plateaus
View on GitHub
☆12Jun 5, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
writer / writing-in-the-margins
View on GitHub
☆121Mar 18, 2026Updated 4 months ago
THUDM / ReST-MCTS
View on GitHub
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
☆709Jan 20, 2025Updated last year
random-matrix-learning / slides
View on GitHub
LaTeX source code for the slides
☆24Jul 15, 2021Updated 5 years ago
Victorwz / LaViA
View on GitHub
☆10Jul 13, 2024Updated 2 years ago
csitfun / ConTRoL-dataset
View on GitHub
Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"
☆11Nov 18, 2022Updated 3 years ago
trotsky1997 / MathBlackBox
View on GitHub
☆1,033Dec 17, 2024Updated last year
NTU-SQUAD / transformers-coqa
View on GitHub
Albert for Conversational Question Answering Challenge
☆21Jun 12, 2023Updated 3 years ago
Feng-Jay / GiantRepair
View on GitHub
Artifact for TOSEM Submission: GiantRepair
☆12Jun 26, 2024Updated 2 years ago
CRIPAC-DIG / SCGAN
View on GitHub
[ICME 2019] Source code and datasets for "Semi-supervised Compatibility Learning Across Categories for Clothing Matching"
☆11Apr 26, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
fangyuan-ksgk / CoT-Reasoning-without-Prompting
View on GitHub
Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting
☆35Mar 19, 2024Updated 2 years ago
init0xyz / AdaCQR
View on GitHub
Implementation of AdaCQR(COLING 2025)
☆15Dec 30, 2024Updated last year
yinzhangyue / EoT
View on GitHub
Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication
☆21Mar 21, 2024Updated 2 years ago
lamda-bbo / mcts-transfer
View on GitHub
Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".
☆13Nov 28, 2024Updated last year
cjyaras / monarch-attention
View on GitHub
MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention (NeurIPS'25 Spotlight)
☆26Feb 22, 2026Updated 5 months ago
rmlarose / QuIC-Seminar
View on GitHub
Code repository for the QuIC Seminar at Michigan State University.
☆15Dec 4, 2019Updated 6 years ago
jordddan / GameEval
View on GitHub
Using conversational games to evaluate powerful LLMs
☆18Sep 3, 2023Updated 2 years ago