Open-DataFlow / RARELinks
Official implementation of RARE: Retrieval-Augmented Reasoning Modeling
☆171Updated this week
Alternatives and similar repositories for RARE
Users that are interested in RARE are comparing it to the libraries listed below
Sorting:
- SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL☆106Updated last week
- [EMNLP 2024] DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models☆70Updated 3 weeks ago
- Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models☆176Updated 6 months ago
- The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://a…☆301Updated 6 months ago
- When Agent Becomes the Scientist – Building Closed-Loop System from Hypothesis to Verification☆197Updated 2 weeks ago
- MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…☆176Updated last month
- ☆130Updated 2 months ago
- [ACL2024 Findings] Knowledge-to-SQL: Enhancing SQL Generation with Data Expert LLM☆57Updated 2 months ago
- [ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"☆145Updated 2 weeks ago
- SurveyForge: On the Outline Heuristics, Memory-Driven Generation, and Multi-dimensional Evaluation for Automated Survey Writing☆143Updated 2 months ago
- A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]☆117Updated 2 weeks ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆168Updated 5 months ago
- Official Repository for Paper: The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning☆49Updated last month
- Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning☆70Updated last month
- We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …☆93Updated last year
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆226Updated 2 months ago
- ☆45Updated 2 months ago
- The repository for the paper titled "Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"☆155Updated 5 months ago
- R1-like Computer-use Agent☆73Updated 2 months ago
- A general AI agent framework that can be adapted to various tasks and environments.☆100Updated 3 months ago
- ☆38Updated this week
- ☆60Updated 2 months ago
- This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.☆60Updated 7 months ago
- AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning (NeurIPS 2024)☆200Updated last month
- A clean and extensible agentic RAG system with modular implementation.☆99Updated last month
- A collection of papers related to knowledge fusion☆56Updated 7 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆58Updated 2 months ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆136Updated 2 months ago
- [ACL 25 main] Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model☆34Updated 2 weeks ago
- (NeurIPS 2024) Official PyTorch implementation of LOVA3☆85Updated 2 months ago