facebookresearch / diplomacy_ciceroLinks
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
☆1,389Updated 6 months ago
Alternatives and similar repositories for diplomacy_cicero
Users that are interested in diplomacy_cicero are comparing it to the libraries listed below
Sorting:
- Monte Carlo tree search in JAX☆2,544Updated last month
- Evolution Through Large Models☆732Updated last year
- ☆546Updated last year
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆842Updated last year
- ☆1,048Updated last year
- ☆1,243Updated 2 years ago
- Building Open-Ended Embodied Agents with Internet-Scale Knowledge☆2,030Updated last year
- Cramming the training of a (BERT-type) language model into limited compute.☆1,349Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆823Updated 2 years ago
- Code for "Learning to summarize from human feedback"☆1,048Updated 2 years ago
- ☆717Updated 2 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆912Updated last year
- ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration ca…☆1,505Updated 2 months ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,714Updated last year
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,788Updated 4 months ago
- A suite of test scenarios for multi-agent reinforcement learning.☆745Updated 2 weeks ago
- A modular RL library to fine-tune language models to human preferences☆2,362Updated last year
- A library for generative social simulation☆1,038Updated 2 weeks ago
- ☆504Updated 3 years ago
- Reflexion: an autonomous agent with dynamic memory and self-reflection☆388Updated last year
- The NetHack Learning Environment☆964Updated last year
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,368Updated 2 years ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆770Updated 11 months ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,010Updated last year
- Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos☆1,547Updated last month
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆825Updated last year
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆2,056Updated 2 years ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆502Updated 8 months ago
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆2,049Updated last year
- Model API for GALACTICA☆2,738Updated 2 years ago