jinbeizame007/pytorch-r2d2-DPG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jinbeizame007/pytorch-r2d2-DPG)

jinbeizame007 / pytorch-r2d2-DPG

PyTorch implementation of R2D2 (Recurrent Replay Distributed DPG (not DQN))

☆14

Alternatives and similar repositories for pytorch-r2d2-DPG

Users that are interested in pytorch-r2d2-DPG are comparing it to the libraries listed below

Sorting:

neka-nat / distributed_rl
View on GitHub
Pytorch implementation of distributed deep reinforcement learning
☆76Jul 4, 2022Updated 3 years ago
ZiyuanMa / R2D2
View on GitHub
An Implementation of Recurrent Experience Replay in Distributed Reinforcement Learning (Kapturowski et al. 2019) in PyTorch
☆53Jul 19, 2022Updated 3 years ago
udion / Transformer-RL
View on GitHub
Experiments with transformer based RL algorithms
☆22Nov 23, 2019Updated 6 years ago
BY571 / D4PG
View on GitHub
PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…
☆24Apr 7, 2021Updated 4 years ago
jvmncs / ParamNoise
View on GitHub
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆30Mar 14, 2019Updated 6 years ago
deep-skill-chaining / deep-skill-chaining
View on GitHub
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆30Sep 24, 2019Updated 6 years ago
baimingc / delay-aware-MBRL
View on GitHub
Codes for Paper "Delay-Aware Model-Based Reinforcement Learning for Continuous Control".
☆28Feb 8, 2020Updated 6 years ago
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
avisingh599 / cog
View on GitHub
[CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning
☆34Oct 28, 2020Updated 5 years ago
DylanSiegel / sheeplz-crypto-bot
View on GitHub
AI-powered cryptocurrency trading bot built using deep reinforcement learning (DRL). The bot is designed as a research platform for devel…
☆10Jan 18, 2025Updated last year
rllab-snu / mog_dqn_car_racing
View on GitHub
☆30Sep 3, 2019Updated 6 years ago
BorealisAI / mtmfrl
View on GitHub
Multi Type Mean Field Reinforcement Learning
☆31Jun 13, 2022Updated 3 years ago
tjuHaoXiaotian / GASIL
View on GitHub
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
☆32Oct 9, 2018Updated 7 years ago
ertsiger / induction-subgoal-automata-rl
View on GitHub
Code for the papers "Induction of Subgoal Automata for Reinforcement Learning" (AAAI-20) and "Induction and Exploitation of Subgoal Autom…
☆14Aug 15, 2023Updated 2 years ago
RoozbehRazavi / BIMRL
View on GitHub
Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)
☆10Dec 1, 2022Updated 3 years ago
llan-ml / tesp
View on GitHub
Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"
☆35May 17, 2019Updated 6 years ago
flowersteam / geppg
View on GitHub
☆36Aug 10, 2018Updated 7 years ago
LJY-XCX / RFTrans
View on GitHub
☆12Nov 5, 2023Updated 2 years ago
SuReLI / llrl
View on GitHub
Lipschitz Lifelong RL
☆11Nov 6, 2020Updated 5 years ago
ShilinC / 3D-ML-Reference
View on GitHub
☆11Feb 2, 2018Updated 8 years ago
zhougroup / IDAC
View on GitHub
Implicit Distributional Actor Critic
☆11Dec 8, 2021Updated 4 years ago
jpmattern / seir-covid19
View on GitHub
☆12Oct 11, 2022Updated 3 years ago
HermiTech-L3C / Morty
View on GitHub
A bipedal humanoid control system using a Physics-Informed Neural Network (PINN) and Reinforcement Learning (RL) for stability and manipu…
☆10Aug 15, 2024Updated last year
binz98 / Multi_Agent_Stackelberg_Decision_Transformer
View on GitHub
Codes for the paper "Sequential Asynchronous Action Coordination in Multi-Agent Systems: A Stackelberg Decision Transformer Approach"
☆15Aug 30, 2024Updated last year
aaronwalsman / ltron-torch-eccv22
View on GitHub
☆10Jul 29, 2022Updated 3 years ago
hbanzhaf / docker_covise
View on GitHub
☆10Sep 21, 2021Updated 4 years ago
Riften / Paper-Reading
View on GitHub
Notes for paper reading.
☆10Mar 2, 2026Updated last week
jjhw / SayCan_experimental
View on GitHub
Proof of concept of the SayCan project applying on real UR5 robot
☆10May 15, 2023Updated 2 years ago
markus-suchi / 3D-DAT
View on GitHub
3D Scene Annotation and Dataset Toolkit
☆10Jun 11, 2023Updated 2 years ago
usman15997 / RL-controlled-Lights-and-I2V-SUMO
View on GitHub
This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…
☆10Oct 12, 2020Updated 5 years ago
bhaktipriya / Atari
View on GitHub
1-step Q Learning from the paper "Asynchronous Methods for Deep Reinforcement Learning"
☆12Mar 13, 2017Updated 8 years ago
anandsaha / rl.capstone
View on GitHub
My Udacity Machine Learning Nanodegree capstone project in Reinforcement Learning
☆10Dec 1, 2017Updated 8 years ago
strongio / dosing-rl-gym
View on GitHub
Patient data simulator following the structure of an open-ai gym.
☆11Jul 9, 2019Updated 6 years ago
shwangtangjun / SVGD-PyTorch
View on GitHub
A PyTorch implementation of SVGD (Stein Variational Gradient Descent), contains all examples including bayesian inference in the paper
☆12Jul 30, 2020Updated 5 years ago
david-abel / state_abstraction
View on GitHub
Code for abstracting, evaluating, and visualizing Markov Decision Processes.
☆10Jan 12, 2017Updated 9 years ago
mossr / CrossEntropyVariants.jl
View on GitHub
Cross-entropy method variants for optimization in Julia
☆12Apr 29, 2021Updated 4 years ago
JuliaPOMDP / POMCP.jl
View on GitHub
Julia Implementation of the POMCP algorithm for solving POMDPs
☆12Aug 6, 2021Updated 4 years ago
root-master / unified-hrl
View on GitHub
Unified Model-Free Hierarchical Reinforcement Learning Framework
☆39Mar 8, 2019Updated 7 years ago
qiaochen / DDPG_MultiAgent
View on GitHub
Multi-Agent training using Deep Deterministic Policy Gradient Networks, Solving the Tennis Environment
☆11Oct 20, 2018Updated 7 years ago