PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))
☆14Mar 22, 2019Updated 7 years ago
Alternatives and similar repositories for pytorch-r2d2-DPG
Users that are interested in pytorch-r2d2-DPG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch☆53Jul 19, 2022Updated 3 years ago
- Pytorch implementation of distributed deep reinforcement learning☆76Jul 4, 2022Updated 3 years ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆12Nov 14, 2019Updated 6 years ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 8 years ago
- Experiments with transformer based RL algorithms☆22Nov 23, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A bipedal humanoid control system using a Physics-Informed Neural Network (PINN) and Reinforcement Learning (RL) for stability and manipu…☆13Mar 25, 2026Updated 3 weeks ago
- Multi-Agent training using Deep Deterministic Policy Gradient Networks, Solving the Tennis Environment☆11Oct 20, 2018Updated 7 years ago
- revolution of CNN classification(LeNet, AlexNet, VGG, Inception, ResNet)☆14Dec 21, 2018Updated 7 years ago
- ICLR Reproducibility Challenge for Discriminator-Actor-Critic☆20Jan 7, 2019Updated 7 years ago
- ☆13Oct 1, 2017Updated 8 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 5 years ago
- A simple PyTorch implementation of Population Based Training of Neural Networks.☆64Mar 14, 2019Updated 7 years ago
- Lightweight multi-agent PPO for IEEE field.☆15Mar 23, 2022Updated 4 years ago
- Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations (ICLR 2020)☆27Oct 12, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems☆32Oct 9, 2018Updated 7 years ago
- AI-powered cryptocurrency trading bot built using deep reinforcement learning (DRL). The bot is designed as a research platform for devel…☆10Jan 18, 2025Updated last year
- Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".☆28Feb 8, 2020Updated 6 years ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆27Feb 11, 2025Updated last year
- Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO☆28Dec 8, 2022Updated 3 years ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆33Updated this week
- Translation and understanding of the Pop-art paper.☆18Oct 21, 2019Updated 6 years ago
- Codes for the paper "SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning"☆10Jun 24, 2022Updated 3 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆30Mar 14, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- Multi Type Mean Field Reinforcement Learning☆31Jun 13, 2022Updated 3 years ago
- Pytorch code for "Learning Guidance Rewards with Trajectory-space Smoothing" (NeurIPS 2020)☆12Jul 7, 2021Updated 4 years ago
- Simulating V2V and V2I connectivity in Matlab using car following, lane changing models and entry and exit ramps on a 4-Highways, 3-Ramps…☆22Jun 15, 2015Updated 10 years ago
- Codes for the paper "Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach"☆15Aug 30, 2024Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Patient data simulator following the structure of an open-ai gym.☆12Jul 9, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- A PyTorch implementation of SVGD (Stein Variational Gradient Descent), contains all examples including bayesian inference in the paper☆12Jul 30, 2020Updated 5 years ago
- Deep Transfer Learning codes using Google TensorFlow☆13Apr 4, 2016Updated 10 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- Proof of concept of the SayCan project applying on real UR5 robot☆10May 15, 2023Updated 2 years ago
- ☆30Sep 3, 2019Updated 6 years ago
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago