DIRECT-BIT / SRA-MCTS
☆28Updated 3 months ago
Alternatives and similar repositories for SRA-MCTS:
Users that are interested in SRA-MCTS are comparing it to the libraries listed below
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆84Updated last month
- ☆15Updated 8 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆51Updated 3 months ago
- This the implementation of LeCo☆32Updated 2 months ago
- Reformatted Alignment☆115Updated 6 months ago
- The official repository of the Omni-MATH benchmark.☆77Updated 3 months ago
- ☆49Updated last year
- The code of arxiv paper: "CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis"☆23Updated 2 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆60Updated 4 months ago
- ☆59Updated 6 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆62Updated 3 weeks ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆52Updated 9 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆45Updated 2 months ago
- ☆44Updated 3 months ago
- ☆52Updated 5 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆30Updated 9 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆47Updated 9 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆66Updated 2 months ago
- ☆60Updated 3 months ago
- A Comprehensive Survey on Long Context Language Modeling☆86Updated last week
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆56Updated 5 months ago
- ☆34Updated 4 months ago
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆30Updated 3 months ago
- Knowledge-Reasoning Synergy Reinforcement Learning.☆31Updated 3 weeks ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆79Updated last year
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆73Updated 9 months ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆47Updated 4 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆101Updated this week
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆25Updated last year
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆55Updated 3 months ago