QiushiSun / CorexLinks
[COLM'24] Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration
☆29Updated 9 months ago
Alternatives and similar repositories for Corex
Users that are interested in Corex are comparing it to the libraries listed below
Sorting:
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆229Updated 6 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆57Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆130Updated 10 months ago
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆31Updated last year
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆69Updated 7 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆158Updated last year
- The demo, code and data of FollowRAG☆74Updated last month
- ☆23Updated 2 years ago
- ☆67Updated last month
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆147Updated 9 months ago
- ☆33Updated 2 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆61Updated 9 months ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆139Updated 9 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆122Updated 4 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆68Updated 2 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆42Updated 5 months ago
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆418Updated 6 months ago
- [EMNLP 2023] Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts☆27Updated last year
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆127Updated 4 months ago
- ☆19Updated last year
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆140Updated 2 months ago
- Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)☆155Updated 5 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆80Updated 6 months ago
- [ACL 2025] A Neural-Symbolic Self-Training Framework☆110Updated 2 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆61Updated 6 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆46Updated 2 months ago
- ☆48Updated 9 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆147Updated 7 months ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆118Updated 2 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated last year