yale-nlp / MCTS-RAG
☆22Updated last month
Alternatives and similar repositories for MCTS-RAG:
Users that are interested in MCTS-RAG are comparing it to the libraries listed below
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆58Updated 5 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated last month
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆82Updated last month
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆47Updated last month
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆31Updated 4 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆66Updated 2 weeks ago
- ☆35Updated 2 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆43Updated 2 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆110Updated 3 weeks ago
- PGRAG☆48Updated 8 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆49Updated last month
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆88Updated last month
- This is the code of MMOA-RAG.☆47Updated 3 weeks ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆90Updated 5 months ago
- Knowledge Unlearning for Large Language Models☆25Updated last week
- ☆65Updated this week
- HyperGraphRAG: Retrieval-Augmented Generation with Hypergraph-Structured Knowledge Representation☆22Updated 2 weeks ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 3 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆99Updated 2 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆33Updated 2 months ago
- [EMNLP 2024] TRACE the Evidence: Constructing Knowledge-Grounded Reasoning Chains for Retrieval-Augmented Generation☆25Updated last week
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆56Updated 2 months ago
- [Preprint] An inference-time decoding strategy with adaptive foresight sampling☆88Updated 2 weeks ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆12Updated 5 months ago
- ☆89Updated 3 weeks ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆24Updated 5 months ago
- The code and data of DPA-RAG☆58Updated 2 months ago
- ☆76Updated 2 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆69Updated last month
- This the implementation of LeCo☆32Updated 2 months ago