mnoukhov / emergent-compete
Code for Emergent Communication under Competition (AAMAS 2021)
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for emergent-compete
- On the pitfalls of measuring emergent communication☆34Updated 5 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Reinforcement Learning papers on exploration methods.☆20Updated 3 years ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Updated 6 years ago
- Reproduce ICLR2018 submission "Emergent Communication through Negotiation"☆17Updated 6 years ago
- BabyAI++: Towards Grounded language Learning beyond Memorization, ICLR BeTR-RL 2020☆25Updated 4 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 3 years ago
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- ☆33Updated 3 months ago
- ☆15Updated 4 years ago
- ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies☆18Updated 4 years ago
- Reinforcement Learning with Latent Flow☆43Updated 3 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Updated last year
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆38Updated 3 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆43Updated last year
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆33Updated 4 years ago
- ☆44Updated 5 years ago
- ☆80Updated last year
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆65Updated 3 years ago
- Contextual Bandits Action Elimination DQN☆19Updated 6 years ago
- Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)☆74Updated 4 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆53Updated 4 years ago
- Invariant Causal Prediction for Block MDPs☆43Updated 4 years ago
- Systematic generalization test for CLEVR☆15Updated 4 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- RL environment replicating the werewolf game to study emergent communication☆18Updated last year
- Variational Reinforcement Learning☆16Updated 3 months ago