facebookresearch / diplomacy_ciceroLinks
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
☆1,392Updated 6 months ago
Alternatives and similar repositories for diplomacy_cicero
Users that are interested in diplomacy_cicero are comparing it to the libraries listed below
Sorting:
- Evolution Through Large Models☆734Updated last year
- ☆548Updated last year
- ☆1,052Updated last year
- Code for "Learning to summarize from human feedback"☆1,051Updated 2 years ago
- ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration ca…☆1,512Updated 2 months ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,350Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆824Updated 3 years ago
- Monte Carlo tree search in JAX☆2,549Updated 2 months ago
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆844Updated last year
- The NetHack Learning Environment☆966Updated last year
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,792Updated 4 months ago
- ☆508Updated 3 years ago
- Code for Parsel 🐍 - generate complex programs with language models☆433Updated 2 years ago
- Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos☆1,572Updated 2 months ago
- Building Open-Ended Embodied Agents with Internet-Scale Knowledge☆2,046Updated last year
- A library for generative social simulation☆1,059Updated last week
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,725Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆828Updated last year
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆913Updated last year
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,374Updated 2 years ago
- ☆1,247Updated 2 years ago
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,063Updated last year
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆634Updated 2 years ago
- MiniWoB++: a web interaction benchmark for reinforcement learning☆349Updated 6 months ago
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,661Updated 5 months ago
- A prize for finding tasks that cause large language models to show inverse scaling☆615Updated 2 years ago
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆550Updated 9 months ago
- ☆305Updated last year
- A suite of test scenarios for multi-agent reinforcement learning.☆750Updated last week
- Ask Me Anything language model prompting☆545Updated 2 years ago