michaelnny / deep_rl_zooLinks

A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

☆121

Alternatives and similar repositories for deep_rl_zoo

Users that are interested in deep_rl_zoo are comparing it to the libraries listed below

Sorting:

semitable / lb-foraging
Level-based Foraging (LBF): A multi-agent environment for RL
☆199Updated last year
nikhilbarhate99 / min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…
☆284Updated 3 years ago
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆186Updated last year
Farama-Foundation / MO-Gymnasium
Multi-objective Gymnasium environments for reinforcement learning
☆357Updated this week
toshikwa / sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
☆313Updated last year
instadeepai / og-marl
Datasets with baselines for Offline MARL.
☆193Updated last month
MarcoMeter / episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆200Updated last year
Howuhh / prioritized_experience_replay
Prioritized Experience Replay implementation with proportional prioritization
☆85Updated 2 years ago
dhruvramani / Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
☆183Updated 2 years ago
AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆330Updated 2 years ago
BY571 / DQN-Atari-Agents
DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…
☆121Updated 5 years ago
Kchu / DeepRL_PyTorch
Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.
☆214Updated 2 years ago
ArnaudFickinger / gym-multigrid
Lightweight multi-agent gridworld Gym environment
☆212Updated 2 years ago
kevslinger / DTQN
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆171Updated last year
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆140Updated last year
openai / phasic-policy-gradient
Code for the paper "Phasic Policy Gradient"
☆267Updated 2 years ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆332Updated 4 years ago
DLR-RM / rl-trained-agents
A collection of pre-trained RL agents using Stable Baselines3
☆141Updated last year
MarcoMeter / recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
☆156Updated last year
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆148Updated 6 years ago
aviralkumar2907 / CQL
Code for conservative Q-learning
☆467Updated 4 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆137Updated 3 months ago
lcswillems / torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
☆205Updated 3 years ago
openrlbenchmark / openrlbenchmark
☆246Updated last year
twni2016 / pomdp-baselines
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆340Updated last year
proroklab / popgym
Partially Observable Process Gym
☆209Updated 6 months ago
sfujim / TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆381Updated 4 years ago
oxwhirl / smacv2
☆281Updated last year
schroederdewitt / multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
☆363Updated 2 years ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆182Updated 3 years ago