☆97Dec 16, 2024Updated last year
Alternatives and similar repositories for mcts-llm
Users that are interested in mcts-llm are comparing it to the libraries listed below
Sorting:
- Monte Carlo Tree Search Self-Refine (MCTSr)☆22Jul 6, 2024Updated last year
- ☆130Jun 18, 2024Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆54Jun 6, 2025Updated 9 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆694Jan 20, 2025Updated last year
- ☆16Oct 5, 2022Updated 3 years ago
- ☆31Oct 2, 2024Updated last year
- HRED VHRED VHCR for Multi-Turn Dialogue Systems☆43Dec 16, 2019Updated 6 years ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,837Jan 17, 2025Updated last year
- pip install continualcode☆35Feb 10, 2026Updated last month
- Dev and Test Data of LogicGame benchmark☆19Mar 31, 2025Updated 11 months ago
- Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"☆18Aug 31, 2019Updated 6 years ago
- Active learning symbolic regression CFD + AI = Wow☆17Apr 21, 2022Updated 3 years ago
- ☆968Jan 23, 2025Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆591Dec 9, 2024Updated last year
- Large Reasoning Models☆807Dec 3, 2024Updated last year
- 本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。☆15Mar 9, 2021Updated 5 years ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆300Nov 16, 2024Updated last year
- Dataset Pinocchio for paper "Towards Understanding Factual Knowledge of Large Language Models" accepted by ICLR 2024 (Spotlight)☆12Mar 13, 2024Updated 2 years ago
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆20Dec 4, 2024Updated last year
- ☆11Apr 4, 2018Updated 7 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆80Apr 12, 2024Updated last year
- ☆42Nov 7, 2023Updated 2 years ago
- ☆48Feb 26, 2025Updated last year
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆63Dec 5, 2024Updated last year
- O1 Replication Journey☆1,999Jan 14, 2025Updated last year
- A library for advanced large language model reasoning☆2,338Jun 10, 2025Updated 9 months ago
- ☆342Jun 5, 2025Updated 9 months ago
- ☆23Dec 8, 2022Updated 3 years ago
- ☆17Oct 9, 2022Updated 3 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- ☆16Sep 5, 2023Updated 2 years ago
- ☆30Feb 10, 2025Updated last year
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)☆9,191Updated this week
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆267Jul 8, 2025Updated 8 months ago
- Targeted Data Generation with Large Language Models☆19Jun 25, 2024Updated last year
- ☆553Jan 2, 2025Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Jun 11, 2025Updated 9 months ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆62Feb 22, 2026Updated 3 weeks ago
- ☆1,347Nov 21, 2024Updated last year