facebookresearch / diplomacy_cicero
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
☆1,325Updated last year
Alternatives and similar repositories for diplomacy_cicero:
Users that are interested in diplomacy_cicero are comparing it to the libraries listed below
- ☆998Updated last year
- Monte Carlo tree search in JAX☆2,436Updated 3 months ago
- ☆524Updated last year
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆829Updated 4 months ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆821Updated 2 years ago
- ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration ca…☆1,425Updated 9 months ago
- A prize for finding tasks that cause large language models to show inverse scaling☆609Updated last year
- Code for "Learning to summarize from human feedback"☆1,014Updated last year
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆515Updated last month
- Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.☆689Updated this week
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,883Updated 9 months ago
- ☆1,543Updated last year
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".☆1,310Updated last year
- Language Modeling with the H3 State Space Model☆515Updated last year
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,596Updated last year
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆632Updated last year
- Mastering Diverse Domains through World Models☆1,536Updated 2 weeks ago
- Code for Parsel 🐍 - generate complex programs with language models☆426Updated last year
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆489Updated last month
- Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.☆1,537Updated 3 weeks ago
- A suite of test scenarios for multi-agent reinforcement learning.☆657Updated this week
- Evolution Through Large Models☆713Updated last year
- Creative interactive views of any dataset.☆837Updated 2 months ago
- A platform for managing machine learning experiments☆839Updated 3 weeks ago
- Ask Me Anything language model prompting☆545Updated last year
- Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos☆1,422Updated 9 months ago
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆799Updated 8 months ago
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,296Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆306Updated 2 years ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,367Updated 11 months ago