FranxYao / GPT-Bargaining
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆204Updated last year
Alternatives and similar repositories for GPT-Bargaining:
Users that are interested in GPT-Bargaining are comparing it to the libraries listed below
- Simple next-token-prediction for RLHF☆222Updated last year
- ☆160Updated last year
- Self-Alignment with Principle-Following Reward Models☆154Updated last year
- FireAct: Toward Language Agent Fine-tuning☆270Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆218Updated last year
- ☆172Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 3 months ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆226Updated last year
- ☆231Updated 2 years ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 9 months ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- A set of utilities for running few-shot prompting experiments on large-language models☆118Updated last year
- Source code for the paper "Active Prompting with Chain-of-Thought for Large Language Models"☆233Updated 9 months ago
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆279Updated 2 weeks ago
- Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".☆149Updated last year
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆108Updated last year
- [ACL 2024] Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View☆111Updated 9 months ago
- ☆175Updated last month
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆94Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆69Updated last year
- Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Langu…☆348Updated last year
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆307Updated 5 months ago
- A framework for human-readable prompt-based method with large language models. Specially designed for researchers. (Deprecated, check out…☆129Updated 2 years ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆113Updated 8 months ago
- A repository for transformer critique learning and generation☆88Updated last year
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆282Updated 9 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆94Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆112Updated last year