facebookresearch / diplomacy_ciceroLinks
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
☆1,368Updated 3 months ago
Alternatives and similar repositories for diplomacy_cicero
Users that are interested in diplomacy_cicero are comparing it to the libraries listed below
Sorting:
- ☆1,229Updated 2 years ago
- ☆1,025Updated last year
- Monte Carlo tree search in JAX☆2,507Updated 3 months ago
- ☆539Updated last year
- Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.☆839Updated 9 months ago
- Evolution Through Large Models☆726Updated last year
- ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration ca…☆1,484Updated last year
- Code for Parsel 🐍 - generate complex programs with language models☆431Updated last year
- Code for "Learning to summarize from human feedback"☆1,032Updated last year
- Cramming the training of a (BERT-type) language model into limited compute.☆1,338Updated last year
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,565Updated last month
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆823Updated 2 years ago
- Dromedary: towards helpful, ethical and reliable LLMs.☆1,148Updated 2 months ago
- ☆714Updated 2 years ago
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆765Updated 8 months ago
- A prize for finding tasks that cause large language models to show inverse scaling☆613Updated last year
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,674Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,482Updated 11 months ago
- Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos☆1,474Updated last year
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,062Updated last year
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,344Updated last year
- ☆2,843Updated last month
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆869Updated last year
- Ask Me Anything language model prompting☆547Updated 2 years ago
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆496Updated 5 months ago
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆634Updated 2 years ago
- A suite of test scenarios for multi-agent reinforcement learning.☆720Updated last week
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆816Updated last year
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,762Updated last month
- Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments☆1,085Updated last year