facebookresearch / diplomacy_ciceroLinks
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
☆1,405Updated 9 months ago
Alternatives and similar repositories for diplomacy_cicero
Users that are interested in diplomacy_cicero are comparing it to the libraries listed below
Sorting:
- Monte Carlo tree search in JAX☆2,584Updated 4 months ago
- Evolution Through Large Models☆737Updated 2 years ago
- ☆551Updated last year
- ☆1,066Updated last year
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆863Updated last year
- ☆1,257Updated 3 years ago
- A suite of test scenarios for multi-agent reinforcement learning.☆781Updated 3 weeks ago
- Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments☆1,134Updated 2 years ago
- Model API for GALACTICA☆2,739Updated 2 years ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,361Updated last year
- ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration ca…☆1,529Updated 5 months ago
- Building Open-Ended Embodied Agents with Internet-Scale Knowledge☆2,122Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆828Updated 3 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆920Updated 2 years ago
- A prize for finding tasks that cause large language models to show inverse scaling☆621Updated 2 years ago
- Code for Parsel 🐍 - generate complex programs with language models☆439Updated 2 years ago
- ☆532Updated 3 years ago
- Code for "Learning to summarize from human feedback"☆1,057Updated 2 years ago
- A library for generative social simulation☆1,155Updated last week
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆505Updated 11 months ago
- Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos☆1,634Updated 4 months ago
- ☆728Updated 2 years ago
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,376Updated 2 years ago
- Dramatron uses large language models to generate coherent scripts and screenplays.☆1,053Updated last year
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,810Updated 7 months ago
- ☆994Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Updated 2 years ago
- ☆328Updated last year
- High throughput synchronous and asynchronous reinforcement learning☆971Updated 2 months ago
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆841Updated last year