Collaborative Deep Reinforcement Learning
☆32Jul 29, 2017Updated 8 years ago
Alternatives and similar repositories for cdrl
Users that are interested in cdrl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆81Nov 22, 2017Updated 8 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 8 years ago
- Robust policy search algorithms which train on model ensembles☆30Oct 26, 2016Updated 9 years ago
- Multi-agent active perception with prediction rewards☆11Nov 13, 2020Updated 5 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Aug 25, 2017Updated 8 years ago
- Asynchronous Advantage Actor Critic☆20Aug 15, 2016Updated 9 years ago
- Conditional Random Fields implemented as Lasagne layer☆10Jul 22, 2016Updated 9 years ago
- Lifelong Variational Autoencoder☆15Dec 6, 2017Updated 8 years ago
- ☆27Dec 2, 2017Updated 8 years ago
- ☆15Sep 5, 2016Updated 9 years ago
- Paper list of multi-agent reinforcement learning (MARL)☆43Oct 30, 2021Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Jan 5, 2019Updated 7 years ago
- Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation☆11Jul 26, 2016Updated 9 years ago
- A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow☆15Apr 27, 2018Updated 7 years ago
- ☆14Sep 27, 2019Updated 6 years ago
- Model Zoo for Deep Reinforcement Learning☆14Dec 19, 2015Updated 10 years ago
- Jointly learning policies and latent representations for driver behavior.☆15Jun 6, 2017Updated 8 years ago
- ☆13Nov 17, 2015Updated 10 years ago
- Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.☆121Oct 12, 2016Updated 9 years ago
- Resilient Multi-Agent Reinforcement Learning☆10Nov 4, 2022Updated 3 years ago
- Solving reinforcement learning tasks which require language and vision☆33Apr 4, 2023Updated 2 years ago
- Convolutional Neural Networks with Recurrent Neural Filters☆53Apr 15, 2019Updated 6 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Jun 24, 2018Updated 7 years ago
- ☆13Apr 3, 2019Updated 6 years ago
- ☆35Jan 29, 2023Updated 3 years ago
- A Lightweight Multi-modality Image Segmentation Network via Domain Adaptation using Gradient Magnitude and Shape Constraint☆10Apr 3, 2023Updated 2 years ago
- Repo containing to-dos and instructions for DRL in POMDPs.jl☆11Jun 21, 2016Updated 9 years ago
- ☆15Oct 29, 2018Updated 7 years ago
- Task-oriented Dialog Policy Learning with Multi-Agent Reinforcement Learning☆53Jun 23, 2020Updated 5 years ago
- Learning RNN Hierarchies☆45Jun 22, 2016Updated 9 years ago
- RWA in pytorch☆14May 7, 2017Updated 8 years ago
- Code for replicating results in 'On Weight Initializations in Deep Neural Networks'☆10Apr 28, 2017Updated 8 years ago
- reinfore learning tool box, contains trpo, a3c algorithm for continous action space☆42Jan 27, 2018Updated 8 years ago
- Tensorflow implementation of SGD with Coupled Adaptive Batch Size (CABS)☆44Feb 28, 2017Updated 9 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- ☆10Aug 9, 2018Updated 7 years ago
- Non-linear policy graph improvement - planning for Dec-POMDPs☆16Mar 3, 2021Updated 5 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago