FranxYao / GPT-Bargaining
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆201Updated last year
Related projects ⓘ
Alternatives and complementary repositories for GPT-Bargaining
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆276Updated 2 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆113Updated last year
- FireAct: Toward Language Agent Fine-tuning☆255Updated last year
- ☆158Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆213Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆101Updated this week
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆92Updated 10 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆155Updated 6 months ago
- Chain-of-Hindsight, A Scalable RLHF Method☆220Updated last year
- Reasoning with Language Model is Planning with World Model☆145Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆87Updated last year
- ☆170Updated last year
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…☆123Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆49Updated 8 months ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆104Updated 5 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents☆250Updated 6 months ago
- Self-Alignment with Principle-Following Reward Models☆148Updated 8 months ago
- A repository for transformer critique learning and generation☆86Updated 11 months ago
- Code accompanying the paper Pretraining Language Models with Human Preferences☆177Updated 9 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆164Updated this week
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆144Updated 8 months ago
- ☆259Updated 11 months ago
- ☆171Updated 6 months ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆102Updated 6 months ago
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆353Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 3 weeks ago
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆266Updated last year
- A large-scale, fine-grained, diverse preference dataset (and models).☆315Updated 10 months ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆240Updated last year
- Data and Code for Program of Thoughts (TMLR 2023)☆243Updated 6 months ago