FranxYao / GPT-Bargaining
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆206Updated last year
Alternatives and similar repositories for GPT-Bargaining:
Users that are interested in GPT-Bargaining are comparing it to the libraries listed below
- Simple next-token-prediction for RLHF☆225Updated last year
- FireAct: Toward Language Agent Fine-tuning☆275Updated last year
- ☆159Updated 2 years ago
- About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…☆70Updated last year
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆333Updated 7 months ago
- Self-Alignment with Principle-Following Reward Models☆160Updated last year
- ☆172Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆158Updated 11 months ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 5 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- ☆179Updated 2 months ago
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆328Updated 11 months ago
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆94Updated last year
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆308Updated 11 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆115Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆218Updated last year
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆44Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Reasoning with Language Model is Planning with World Model☆164Updated last year
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆124Updated 9 months ago
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆115Updated 11 months ago
- Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Langu…☆347Updated last year
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆284Updated 2 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆135Updated 5 months ago
- Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".☆152Updated last year
- A repository for transformer critique learning and generation☆90Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆134Updated 5 months ago
- ToolBench, an evaluation suite for LLM tool manipulation capabilities.☆150Updated last year
- ☆132Updated last year
- augmented LLM with self reflection☆119Updated last year