prasoongoyal / rl-learn
Using Natural Language for Reward Shaping in Reinforcement Learning
☆23Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for rl-learn
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆36Updated 4 years ago
- BabyAI++: Towards Grounded language Learning beyond Memorization, ICLR BeTR-RL 2020☆25Updated 4 years ago
- PyTorch Implementation of "Language as an Abstraction for Hierarchical Deep Reinforcement Learning" paper☆23Updated 2 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆19Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆31Updated 4 years ago
- On the pitfalls of measuring emergent communication☆34Updated 5 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 3 years ago
- ☆41Updated 6 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago
- Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)☆19Updated 4 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- Reinforcement Learning papers on exploration methods.☆20Updated 3 years ago
- Solving reinforcement learning tasks which require language and vision☆32Updated last year
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Updated 6 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆48Updated 5 years ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆10Updated 6 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆101Updated last year
- Change-Based Exploration Transfer☆36Updated 2 years ago
- ☆37Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 2 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆15Updated 3 months ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architecture☆18Updated 5 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- Repository for our ICML 2019 paper: Curiosity-Bottleneck☆33Updated last year
- A simple and easy to use implementation of the soft actor-critic algorithm.☆15Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- cjlovering / Towards-Interpretable-Reinforcement-Learning-Using-Attention-Augmented-Agents-Replication☆22Updated 5 years ago