NumberChiffre / mcts-llm
☆87Updated last month
Alternatives and similar repositories for mcts-llm:
Users that are interested in mcts-llm are comparing it to the libraries listed below
- Code implementation of synthetic continued pretraining☆79Updated last week
- ☆105Updated 6 months ago
- ☆98Updated last month
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆64Updated last month
- Reformatted Alignment☆113Updated 3 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆208Updated 3 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆109Updated 2 months ago
- ☆136Updated 6 months ago
- ☆48Updated 10 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆88Updated 3 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆102Updated 6 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆48Updated 3 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆136Updated 6 months ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆40Updated 2 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆171Updated 3 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆129Updated 2 months ago
- ☆40Updated last month
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆98Updated last month
- Collection of papers for scalable automated alignment.☆82Updated 2 months ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆37Updated 5 months ago
- ☆81Updated 8 months ago
- ☆119Updated last month
- ☆62Updated 3 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆58Updated 2 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆40Updated 6 months ago
- ☆247Updated 5 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆272Updated 5 months ago
- ☆49Updated last month
- ☆92Updated 9 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆42Updated 6 months ago