yale-nlp / MCTS-RAGLinks
☆47Updated 3 months ago
Alternatives and similar repositories for MCTS-RAG
Users that are interested in MCTS-RAG are comparing it to the libraries listed below
Sorting:
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- PGRAG☆48Updated 10 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆57Updated 3 weeks ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆54Updated 3 months ago
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆32Updated last month
- ☆80Updated 2 weeks ago
- This is the code of MMOA-RAG.☆53Updated 3 weeks ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆23Updated last year
- A Comprehensive Library for Memory of LLM-based Agents.☆36Updated 3 weeks ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆94Updated 3 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆55Updated last week
- The official code of paper “Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning”☆99Updated this week
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆102Updated 4 months ago
- Efficient Agent Training for Computer Use☆94Updated last week
- ☆47Updated 5 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆151Updated last month
- ☆102Updated 5 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆102Updated 7 months ago
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆73Updated 7 months ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆75Updated last week
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆36Updated 3 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆111Updated 2 months ago
- ☆97Updated 3 months ago
- ☆31Updated 6 months ago
- ☆94Updated 5 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆97Updated 3 months ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆52Updated 2 weeks ago
- ☆36Updated 4 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆59Updated last month
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆62Updated 5 months ago