facebookresearch / diplomacy_cicero
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
☆1,342Updated last month
Alternatives and similar repositories for diplomacy_cicero
Users that are interested in diplomacy_cicero are comparing it to the libraries listed below
Sorting:
- ☆532Updated last year
- Monte Carlo tree search in JAX☆2,479Updated last month
- Evolution Through Large Models☆718Updated last year
- ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration ca…☆1,459Updated 11 months ago
- Dromedary: towards helpful, ethical and reliable LLMs.☆1,145Updated last week
- Cramming the training of a (BERT-type) language model into limited compute.☆1,331Updated 11 months ago
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆839Updated 7 months ago
- Ask Me Anything language model prompting☆548Updated last year
- A prize for finding tasks that cause large language models to show inverse scaling☆611Updated last year
- ☆1,042Updated 2 years ago
- Building Open-Ended Embodied Agents with Internet-Scale Knowledge☆1,959Updated last year
- Model API for GALACTICA☆2,723Updated 2 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆821Updated 2 years ago
- Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments☆1,073Updated last year
- ☆710Updated last year
- Code for Parsel 🐍 - generate complex programs with language models☆430Updated last year
- Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos☆1,450Updated 11 months ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆763Updated 6 months ago
- A library for generative social simulation☆870Updated last week
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,643Updated last year
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆2,032Updated 9 months ago
- Quantized inference code for LLaMA models☆1,048Updated 2 years ago
- A suite of test scenarios for multi-agent reinforcement learning.☆697Updated last week
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆632Updated last year
- This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (Neur…☆531Updated 3 months ago
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆532Updated 8 months ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆309Updated 2 years ago
- Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".☆1,129Updated last year
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,742Updated last year
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆491Updated 3 months ago