gauss5930 / LLM-Agora
LLM Agora, debating between open-source LLMs to refine the answers
☆54Updated last year
Alternatives and similar repositories for LLM-Agora:
Users that are interested in LLM-Agora are comparing it to the libraries listed below
- augmented LLM with self reflection☆109Updated last year
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆95Updated 9 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆100Updated this week
- ☆93Updated 6 months ago
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆386Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆53Updated 10 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆111Updated 4 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆42Updated 11 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆106Updated last month
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆106Updated 8 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆98Updated last month
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆125Updated 8 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆97Updated 3 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆85Updated 11 months ago
- ☆81Updated last year
- [EMNLP Findings 2024 & ACL 2024 NLRSE Oral] Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards☆49Updated 8 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆48Updated 5 months ago
- ☆81Updated this week
- ☆113Updated 2 months ago
- ☆135Updated 3 months ago
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆54Updated last week
- ☆89Updated this week
- Reasoning with Language Model is Planning with World Model☆154Updated last year
- A banchmark list for evaluation of large language models.☆76Updated 6 months ago
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆48Updated last year
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆88Updated 3 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆55Updated 8 months ago
- ☆115Updated 3 months ago
- ☆110Updated 3 weeks ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆175Updated last month