facebookresearch / diplomacy_ciceroLinks
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
☆1,387Updated 5 months ago
Alternatives and similar repositories for diplomacy_cicero
Users that are interested in diplomacy_cicero are comparing it to the libraries listed below
Sorting:
- Monte Carlo tree search in JAX☆2,538Updated 3 weeks ago
- Evolution Through Large Models☆733Updated last year
- ☆1,038Updated last year
- ☆546Updated last year
- ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration ca…☆1,504Updated last month
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆841Updated 11 months ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆911Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆824Updated 2 years ago
- ☆1,244Updated 2 years ago
- Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos☆1,538Updated 3 weeks ago
- Dromedary: towards helpful, ethical and reliable LLMs.☆1,146Updated last week
- A library for generative social simulation☆1,024Updated last week
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆503Updated 7 months ago
- A neurosymbolic perspective on LLMs☆1,598Updated last week
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆826Updated last year
- Building Open-Ended Embodied Agents with Internet-Scale Knowledge☆2,019Updated last year
- The NetHack Learning Environment☆965Updated last year
- Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments☆1,102Updated last year
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,712Updated last year
- ☆501Updated 3 years ago
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,364Updated 2 years ago
- Reflexion: an autonomous agent with dynamic memory and self-reflection☆388Updated last year
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,785Updated 3 months ago
- A suite of test scenarios for multi-agent reinforcement learning.☆735Updated 3 weeks ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,349Updated last year
- [Neurips 2023] Generating Mario Levels with GPT2. Code for the paper "MarioGPT: Open-Ended Text2Level Generation through Large Language M…☆1,136Updated last year
- Code for Parsel 🐍 - generate complex programs with language models☆432Updated 2 years ago
- ☆2,788Updated last year
- Code for "Learning to summarize from human feedback"☆1,043Updated 2 years ago
- ☆2,164Updated last year