facebookresearch / diplomacy_ciceroLinks
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
☆1,397Updated 7 months ago
Alternatives and similar repositories for diplomacy_cicero
Users that are interested in diplomacy_cicero are comparing it to the libraries listed below
Sorting:
- ☆548Updated last year
- Monte Carlo tree search in JAX☆2,563Updated 2 months ago
- ☆1,056Updated last year
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆846Updated last year
- ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration ca…☆1,516Updated 3 months ago
- Evolution Through Large Models☆735Updated 2 years ago
- Building Open-Ended Embodied Agents with Internet-Scale Knowledge☆2,064Updated last year
- ☆1,249Updated 2 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆916Updated last year
- Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments☆1,121Updated last year
- ☆723Updated 2 years ago
- The NetHack Learning Environment☆970Updated last year
- Code for "Learning to summarize from human feedback"☆1,054Updated 2 years ago
- A suite of test scenarios for multi-agent reinforcement learning.☆758Updated 3 weeks ago
- ☆510Updated 3 years ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆504Updated 9 months ago
- Code for Parsel 🐍 - generate complex programs with language models☆433Updated 2 years ago
- A prize for finding tasks that cause large language models to show inverse scaling☆619Updated 2 years ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,352Updated last year
- Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos☆1,581Updated 2 months ago
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,798Updated 5 months ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆826Updated 3 years ago
- Dramatron uses large language models to generate coherent scripts and screenplays.☆1,024Updated last year
- Dromedary: towards helpful, ethical and reliable LLMs.☆1,144Updated 2 months ago
- ☆961Updated last year
- Benchmarking the Spectrum of Agent Capabilities☆489Updated last year
- ☆538Updated 2 years ago
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆637Updated 2 years ago
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆837Updated last year
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆2,056Updated last year