instadeepai / DebateLLMLinks
Benchmarking Multi-Agent Debate between Language Models for Truthfulness in Q&A.
☆43Updated last year
Alternatives and similar repositories for DebateLLM
Users that are interested in DebateLLM are comparing it to the libraries listed below
Sorting:
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆466Updated 10 months ago
- This is the repository for the Tool Learning survey.☆457Updated 3 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆148Updated last year
- Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"☆308Updated last year
- Awesome papers for role-playing with language models☆210Updated last year
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆249Updated last year
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆181Updated last year
- ☆427Updated 4 months ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆143Updated 6 months ago
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆239Updated last year
- ☆107Updated last year
- Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)☆178Updated 3 weeks ago
- LLM hallucination paper list☆327Updated last year
- Official repository for RAG-Gym☆116Updated 9 months ago
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future☆470Updated 10 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆231Updated 10 months ago
- [NeurIPS 2023] Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"☆111Updated 2 years ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆194Updated last year
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆160Updated 6 months ago
- The official repo for our paper: LegalAgentBench: Evaluating LLM Agents in Legal Domainl☆33Updated 11 months ago
- ☆162Updated 10 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆132Updated 9 months ago
- ☆78Updated last year
- [ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models☆64Updated 8 months ago
- Generative Judge for Evaluating Alignment☆248Updated last year
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆156Updated 11 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆198Updated 7 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆135Updated 9 months ago
- Data and Code for Program of Thoughts [TMLR 2023]☆292Updated last year
- Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering☆62Updated 9 months ago