FranxYao/GPT-Bargaining

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FranxYao/GPT-Bargaining)

FranxYao / GPT-Bargaining

Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback

☆207

Alternatives and similar repositories for GPT-Bargaining

Users that are interested in GPT-Bargaining are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

agi-templar / Stable-Alignment
View on GitHub
Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Langu…
☆356Jun 18, 2023Updated 3 years ago
HITsz-TMG / ICL-State-Vector
View on GitHub
☆12Jul 4, 2024Updated 2 years ago
Zce1112zslx / IKE
View on GitHub
☆41Nov 30, 2023Updated 2 years ago
jlin816 / dialop
View on GitHub
DialOp: Decision-oriented dialogue environments for collaborative language agents
☆114Nov 15, 2024Updated last year
THUNLP-MT / CODIS
View on GitHub
Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".
☆13Oct 14, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Farama-Foundation / ChatArena
View on GitHub
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration ca…
☆1,551Aug 11, 2025Updated 11 months ago
FranxYao / chain-of-thought-hub
View on GitHub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
☆2,776Aug 4, 2024Updated last year
HazyResearch / TART
View on GitHub
TART: A plug-and-play Transformer module for task-agnostic reasoning
☆201Jun 22, 2023Updated 3 years ago
wenhuchen / TheoremQA
View on GitHub
The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset
☆161Apr 23, 2024Updated 2 years ago
sail-sg / symbolic-instruction-tuning
View on GitHub
The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".
☆65Apr 18, 2023Updated 3 years ago
MikeWangWZHL / Solo-Performance-Prompting
View on GitHub
Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"
☆352May 8, 2024Updated 2 years ago
HazyResearch / embroid
View on GitHub
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Aug 12, 2023Updated 2 years ago
composable-models / llm_multiagent_debate
View on GitHub
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate
☆544Apr 24, 2025Updated last year
vicgalle / zero-shot-reward-models
View on GitHub
ZYN: Zero-Shot Reward Models with Yes-No Questions
☆34Aug 15, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
snu-mllab / Context-Memory
View on GitHub
Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)
☆63Apr 18, 2024Updated 2 years ago
feyzaakyurek / rl4f
View on GitHub
Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.
☆63Nov 27, 2024Updated last year
THU-KEG / EvaluationPapers4ChatGPT
View on GitHub
Resource, Evaluation and Detection Papers for ChatGPT
☆456Mar 21, 2024Updated 2 years ago
swarnaHub / ExplanationIntervention
View on GitHub
[NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind
☆66Dec 21, 2023Updated 2 years ago
google-research / r_u_sure
View on GitHub
Code accompanying the paper "R-U-SURE? Uncertainty-Aware Code Suggestions By Maximizing Utility Across Random User Intents"
☆23Jul 8, 2026Updated 2 weeks ago
JetRunner / SuperICL
View on GitHub
Code for "Small Models are Valuable Plug-ins for Large Language Models"
☆131May 16, 2023Updated 3 years ago
yinzhangyue / EoT
View on GitHub
Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication
☆21Mar 21, 2024Updated 2 years ago
princeton-nlp / PTP
View on GitHub
Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073
☆32Jul 9, 2024Updated 2 years ago
thunlp / UltraChat
View on GitHub
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
☆2,875Mar 13, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tianjunz / HIR
View on GitHub
☆157Mar 18, 2023Updated 3 years ago
thu-coai / PICL
View on GitHub
Code for ACL2023 paper: Pre-Training to Learn in Context
☆106Jul 26, 2024Updated 2 years ago
Shawn-Guo-CN / Lossless_Text_Compression_with_Transformer
View on GitHub
This repo is to demo the concept of lossless compression with Transformers as encoder and decoder.
☆14May 2, 2024Updated 2 years ago
OFA-Sys / gsm8k-ScRel
View on GitHub
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
☆269Sep 12, 2024Updated last year
nightdessert / Retrieval_Head
View on GitHub
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
☆241Aug 2, 2024Updated last year
princeton-nlp / ALCE
View on GitHub
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
☆523Oct 9, 2024Updated last year
neelsjain / BYOD
View on GitHub
The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"
☆108Sep 23, 2023Updated 2 years ago
IBM / SALMON
View on GitHub
Self-Alignment with Principle-Following Reward Models
☆170Sep 18, 2025Updated 10 months ago
chujiezheng / LLM-Extrapolation
View on GitHub
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆75May 20, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
martin-wey / CodeUltraFeedback
View on GitHub
CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)
☆76Jun 25, 2024Updated 2 years ago
chuanyang-Zheng / Progressive-Hint
View on GitHub
This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"
☆208Oct 11, 2023Updated 2 years ago
THUDM / AgentBench
View on GitHub
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
☆3,603Feb 8, 2026Updated 5 months ago
madaan / self-refine
View on GitHub
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
☆814Oct 4, 2024Updated last year
mandyyyyii / scibench
View on GitHub
☆132Jul 8, 2024Updated 2 years ago
anchen1011 / FireAct
View on GitHub
FireAct: Toward Language Agent Fine-tuning
☆296Oct 22, 2023Updated 2 years ago
facebookresearch / Shepherd
View on GitHub
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆224Aug 10, 2023Updated 2 years ago