☆97Dec 16, 2024Updated last year
Alternatives and similar repositories for mcts-llm
Users that are interested in mcts-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jul 21, 2024Updated last year
- Toy implementation of Strawberry☆33Sep 24, 2024Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆54Jun 6, 2025Updated 11 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆704Jan 20, 2025Updated last year
- ☆1,035Dec 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆32Oct 2, 2024Updated last year
- HRED VHRED VHCR for Multi-Turn Dialogue Systems☆43Dec 16, 2019Updated 6 years ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,842Jan 17, 2025Updated last year
- pip install continualcode☆41Feb 10, 2026Updated 3 months ago
- Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"☆18Aug 31, 2019Updated 6 years ago
- Dev and Test Data of LogicGame benchmark☆19Mar 31, 2025Updated last year
- Automated neural architecture search algorithms implemented in PyTorch and Autogluon toolkit.☆12Apr 17, 2020Updated 6 years ago
- Active learning symbolic regression CFD + AI = Wow☆17Apr 21, 2022Updated 4 years ago
- ☆970Jan 23, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆596Dec 9, 2024Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆100Oct 3, 2025Updated 7 months ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 4 months ago
- 本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。☆15Mar 9, 2021Updated 5 years ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆301Nov 16, 2024Updated last year
- Dataset Pinocchio for paper "Towards Understanding Factual Knowledge of Large Language Models" accepted by ICLR 2024 (Spotlight)☆12Mar 13, 2024Updated 2 years ago
- ☆10Oct 20, 2020Updated 5 years ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆20Dec 4, 2024Updated last year
- ☆11Apr 4, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆80Apr 12, 2024Updated 2 years ago
- ☆48Feb 26, 2025Updated last year
- O1 Replication Journey☆2,000Jan 14, 2025Updated last year
- A library for advanced large language model reasoning☆2,343Jun 10, 2025Updated 11 months ago
- ☆341Jun 5, 2025Updated 11 months ago
- ☆10Oct 14, 2023Updated 2 years ago
- ☆23Dec 8, 2022Updated 3 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆16Sep 5, 2023Updated 2 years ago
- ☆16Oct 16, 2023Updated 2 years ago
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…☆9,523Updated this week
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆269Jul 8, 2025Updated 10 months ago
- A repository to get acquainted with basic training tasks in natural language processing and machine learning☆11Dec 27, 2023Updated 2 years ago
- Targeted Data Generation with Large Language Models☆19Jun 25, 2024Updated last year
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"☆14Dec 2, 2020Updated 5 years ago