QiushiSun / CorexLinks
[COLM'24] Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration
☆32Updated last year
Alternatives and similar repositories for Corex
Users that are interested in Corex are comparing it to the libraries listed below
Sorting:
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆232Updated 11 months ago
- ☆24Updated 2 years ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆71Updated 7 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆150Updated last year
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆143Updated last year
- The demo, code and data of FollowRAG☆75Updated 5 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆44Updated last year
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆64Updated last year
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆201Updated 8 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆129Updated 9 months ago
- Neural Code Intelligence Survey 2024; Reading lists and resources☆279Updated 4 months ago
- EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.☆136Updated 2 years ago
- The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl☆35Updated 11 months ago
- Collection of papers for scalable automated alignment.☆94Updated last year
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆167Updated last year
- Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)☆182Updated 2 weeks ago
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆480Updated this week
- ☆69Updated 6 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆189Updated last year
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆294Updated last month
- Awesome papers for role-playing with language models☆215Updated last year
- ☆110Updated last year
- ☆189Updated last year
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆154Updated 11 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆159Updated last year
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆242Updated last year
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆78Updated last year
- Data and Code for Program of Thoughts [TMLR 2023]☆300Updated last year
- Generative Judge for Evaluating Alignment☆248Updated last year
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 9 months ago