ravi-lanka-4 / CoPiEr
Co-training for Policy Learning
☆13Updated 5 years ago
Alternatives and similar repositories for CoPiEr:
Users that are interested in CoPiEr are comparing it to the libraries listed below
- ICRL 2020☆19Updated 4 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- The implementation of Discriminator Soft Actor Critic☆14Updated 5 years ago
- ☆19Updated 3 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 5 years ago
- This is the source code for solving the Traveling Salesman Problems (TSP) using Monte Carlo tree search (MCTS).☆30Updated 5 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Updated 5 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆33Updated 4 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆21Updated 3 years ago
- PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning☆49Updated 3 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Updated 5 years ago
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Updated 5 years ago
- A simple and easy to use implementation of the soft actor-critic algorithm.☆15Updated 2 years ago
- 🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation☆36Updated 2 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)☆33Updated 5 years ago
- Source code for "Multi-objective Model-based Policy Search for Data-efficient Learning with Sparse Rewards" (CoRL 2018)☆13Updated 6 years ago
- ☆35Updated 6 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆22Updated 9 months ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Autoregressive policies for continuous control reinforcement learning☆28Updated 5 years ago
- Avenue is a simulator designed to test and prototype reinforcement learning algorithms. Avenue is a ServiceNow Research project that was …☆15Updated 2 years ago
- Dead-ends and Secure Exploration in Reinforcement Learning☆11Updated 5 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 3 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 4 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆18Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago