instadeepai / DebateLLMLinks
Benchmarking Multi-Agent Debate between Language Models for Truthfulness in Q&A.
☆34Updated last year
Alternatives and similar repositories for DebateLLM
Users that are interested in DebateLLM are comparing it to the libraries listed below
Sorting:
- ☆89Updated 7 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆123Updated 8 months ago
- A framework for editing the CoTs for better factuality☆49Updated last year
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆56Updated last year
- This is the code of MMOA-RAG.☆53Updated 3 weeks ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆55Updated 3 weeks ago
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆87Updated last year
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆13Updated 7 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆61Updated 4 months ago
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆67Updated last month
- [ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆63Updated 7 months ago
- [ACL 2024] On the Multi-turn Instruction Following for Conversational Web Agents☆16Updated 7 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆97Updated 4 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆48Updated 11 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆57Updated 7 months ago
- We have released the code and demo program required for LLM with self-verification☆60Updated last year
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆94Updated 3 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆69Updated 10 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆123Updated 2 months ago
- ☆59Updated last week
- Code implementation of synthetic continued pretraining☆111Updated 5 months ago
- The official code of paper “Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning”☆117Updated this week
- A trainable user simulator☆34Updated 8 months ago
- ☆42Updated 7 months ago
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models☆97Updated 9 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆61Updated this week
- ☆144Updated 4 months ago
- ☆60Updated 2 weeks ago
- ☆52Updated 8 months ago
- Official repository for RAG-Gym☆80Updated 3 months ago