☆98Dec 16, 2024Updated last year
Alternatives and similar repositories for mcts-llm
Users that are interested in mcts-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Monte Carlo Tree Search Self-Refine (MCTSr)☆22Jul 6, 2024Updated last year
- ☆11Jul 21, 2024Updated last year
- ☆130Jun 18, 2024Updated 2 years ago
- Toy implementation of Strawberry☆33Sep 24, 2024Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆55Jun 6, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆706Jan 20, 2025Updated last year
- ☆1,034Dec 17, 2024Updated last year
- ☆16Oct 5, 2022Updated 3 years ago
- Resources regarding evML (edge verified machine learning)☆23Jan 4, 2025Updated last year
- HRED VHRED VHCR for Multi-Turn Dialogue Systems☆43Dec 16, 2019Updated 6 years ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,848Jan 17, 2025Updated last year
- pip install continualcode☆43Feb 10, 2026Updated 4 months ago
- Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"☆18Aug 31, 2019Updated 6 years ago
- Active learning symbolic regression CFD + AI = Wow☆17Apr 21, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆972Jan 23, 2025Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆597Dec 9, 2024Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆103Oct 3, 2025Updated 8 months ago
- Large Reasoning Models☆803Dec 3, 2024Updated last year
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆40Jan 16, 2026Updated 5 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆303Nov 16, 2024Updated last year
- Dataset Pinocchio for paper "Towards Understanding Factual Knowledge of Large Language Models" accepted by ICLR 2024 (Spotlight)☆12Mar 13, 2024Updated 2 years ago
- An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.☆260Jun 22, 2026Updated last week
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆20Dec 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Apr 4, 2018Updated 8 years ago
- ☆42Nov 7, 2023Updated 2 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆83Apr 12, 2024Updated 2 years ago
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆63Dec 5, 2024Updated last year
- ☆48Feb 26, 2025Updated last year
- O1 Replication Journey☆2,001Jan 14, 2025Updated last year
- ☆28Apr 14, 2025Updated last year
- ☆10Oct 14, 2023Updated 2 years ago
- ☆23Dec 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Sep 5, 2023Updated 2 years ago
- ☆33Feb 10, 2025Updated last year
- ☆16Oct 16, 2023Updated 2 years ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…☆9,710Jun 17, 2026Updated 2 weeks ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆270Jul 8, 2025Updated 11 months ago
- A repository to get acquainted with basic training tasks in natural language processing and machine learning☆11Dec 27, 2023Updated 2 years ago
- Targeted Data Generation with Large Language Models☆19Jun 25, 2024Updated 2 years ago