PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))
☆14Mar 22, 2019Updated 7 years ago
Alternatives and similar repositories for pytorch-r2d2-DPG
Users that are interested in pytorch-r2d2-DPG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch☆53Jul 19, 2022Updated 3 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Jul 4, 2022Updated 3 years ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆12Nov 14, 2019Updated 6 years ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- Experiments with transformer based RL algorithms☆22Nov 23, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A bipedal humanoid control system using a Physics-Informed Neural Network (PINN) and Reinforcement Learning (RL) for stability and manipu…☆12Updated this week
- Multi-Agent training using Deep Deterministic Policy Gradient Networks, Solving the Tennis Environment☆11Oct 20, 2018Updated 7 years ago
- PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies☆58Jan 22, 2021Updated 5 years ago
- revolution of CNN classification(LeNet, AlexNet, VGG, Inception, ResNet)☆14Dec 21, 2018Updated 7 years ago
- UCB CS294-112 深度强化学习中文笔记☆51Jan 2, 2021Updated 5 years ago
- Collection of reinforcement learning algorithms☆16Sep 29, 2025Updated 6 months ago
- ICLR Reproducibility Challenge for Discriminator-Actor-Critic☆20Jan 7, 2019Updated 7 years ago
- SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's arch…☆836Nov 29, 2022Updated 3 years ago
- A simple PyTorch implementation of Population Based Training of Neural Networks.☆64Mar 14, 2019Updated 7 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Lightweight multi-agent PPO for IEEE field.☆15Mar 23, 2022Updated 4 years ago
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆32Oct 9, 2018Updated 7 years ago
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆28Feb 8, 2020Updated 6 years ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆27Feb 11, 2025Updated last year
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- Translation and understanding of the Pop-art paper.☆17Oct 21, 2019Updated 6 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 7 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆11Oct 19, 2020Updated 5 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- Multi Type Mean Field Reinforcement Learning☆31Jun 13, 2022Updated 3 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- Simulating V2V and V2I connectivity in Matlab using car following, lane changing models and entry and exit ramps on a 4-Highways, 3-Ramps…☆21Jun 15, 2015Updated 10 years ago
- Codes for the paper "Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach"☆15Aug 30, 2024Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Patient data simulator following the structure of an open-ai gym.☆12Jul 9, 2019Updated 6 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A PyTorch implementation of SVGD (Stein Variational Gradient Descent), contains all examples including bayesian inference in the paper☆12Jul 30, 2020Updated 5 years ago
- Deep Transfer Learning codes using Google TensorFlow☆13Apr 4, 2016Updated 9 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- Proof of concept of the SayCan project applying on real UR5 robot☆10May 15, 2023Updated 2 years ago
- ☆30Sep 3, 2019Updated 6 years ago
- ☆45Feb 12, 2021Updated 5 years ago
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago