jinbeizame007 / pytorch-r2d2-DPGView external linksLinks
PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))
☆14Mar 22, 2019Updated 6 years ago
Alternatives and similar repositories for pytorch-r2d2-DPG
Users that are interested in pytorch-r2d2-DPG are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of distributed deep reinforcement learning☆76Jul 4, 2022Updated 3 years ago
- An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch☆54Jul 19, 2022Updated 3 years ago
- ICLR Reproducibility Challenge for Discriminator-Actor-Critic☆20Jan 7, 2019Updated 7 years ago
- Experiments with transformer based RL algorithms☆22Nov 23, 2019Updated 6 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 4 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 6 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆28Feb 8, 2020Updated 6 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning☆34Oct 28, 2020Updated 5 years ago
- AI-powered cryptocurrency trading bot built using deep reinforcement learning (DRL). The bot is designed as a research platform for devel…☆10Jan 18, 2025Updated last year
- ☆30Sep 3, 2019Updated 6 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆35May 17, 2019Updated 6 years ago
- The Tensorflow code and a DeepMind Lab wrapper for my article "Meta-Reinforcement Learning" on FloydHub.☆37Mar 28, 2019Updated 6 years ago
- Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…☆13Aug 15, 2023Updated 2 years ago
- ☆36Aug 10, 2018Updated 7 years ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Julia Implementation of the POMCP algorithm for solving POMDPs☆12Aug 6, 2021Updated 4 years ago
- 3D Scene Annotation and Dataset Toolkit☆10Jun 11, 2023Updated 2 years ago
- Notes for paper reading.☆10Dec 27, 2025Updated last month
- My Udacity Machine Learning Nanodegree capstone project in Reinforcement Learning☆10Dec 1, 2017Updated 8 years ago
- Lipschitz Lifelong RL☆11Nov 6, 2020Updated 5 years ago
- ☆12Nov 5, 2023Updated 2 years ago
- Patient data simulator following the structure of an open-ai gym.☆11Jul 9, 2019Updated 6 years ago
- Udacity Self Driving Car Nanodegree - Vehicle Detection☆10Oct 30, 2018Updated 7 years ago
- A bipedal humanoid control system using a Physics-Informed Neural Network (PINN) and Reinforcement Learning (RL) for stability and manipu…☆10Aug 15, 2024Updated last year
- Codes for the paper "Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach"☆15Aug 30, 2024Updated last year
- 1-step Q Learning from the paper "Asynchronous Methods for Deep Reinforcement Learning"☆12Mar 13, 2017Updated 8 years ago
- ☆10Sep 21, 2021Updated 4 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- Cross-entropy method variants for optimization in Julia☆12Apr 29, 2021Updated 4 years ago
- ☆11Feb 2, 2018Updated 8 years ago
- A PyTorch implementation of SVGD (Stein Variational Gradient Descent), contains all examples including bayesian inference in the paper☆12Jul 30, 2020Updated 5 years ago
- Unified Model-Free Hierarchical Reinforcement Learning Framework☆39Mar 8, 2019Updated 6 years ago
- A javascript visualization of the math genealogy database☆10Oct 27, 2023Updated 2 years ago
- Mongoose OS Captive Portal Library☆13Mar 1, 2022Updated 3 years ago
- Some PyTorch code for the Kaggle Speech Recognition Challenge☆12Feb 7, 2019Updated 7 years ago