jihoonerd / Human-level-control-through-deep-reinforcement-learningLinks
đ Paper: Human-level control through deep reinforcement learning đšī¸
â51Updated last year
Alternatives and similar repositories for Human-level-control-through-deep-reinforcement-learning
Users that are interested in Human-level-control-through-deep-reinforcement-learning are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flowâ38Updated 11 months ago
- Experiments with reinforcement learning and recurrent neural networksâ115Updated last year
- PyTorch implementation of GAIL and AIRL based on PPO.â226Updated 4 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.â100Updated 5 months ago
- A simple implementation of Generative Adversarial Imitation Learning with PyTorchâ168Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action spaceâ69Updated last year
- Proximal Policy Optimization (Continuous Version) in PyTorch.â29Updated 5 months ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.â69Updated 4 months ago
- Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.â29Updated 5 months ago
- A Reinforcement Learning Project using PPO + Transformerâ70Updated 2 years ago
- PyTorch implementation of SAC-Discrete.â311Updated last year
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.â359Updated 2 years ago
- Stable-Baselines3 (SB3) reinforcement learning tutorial for the Reinforcement Learning Virtual School 2021.â56Updated 2 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.â146Updated 3 years ago
- Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.â59Updated 4 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.â172Updated 10 months ago
- A collection of pre-trained RL agents using Stable Baselines3â138Updated 11 months ago
- An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video gamesâ32Updated 2 years ago
- â14Updated 5 years ago
- PyTorch implementation of DDPG for continuous control tasks.â46Updated 5 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"â180Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic(SAC).â105Updated 5 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RLâ375Updated 3 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022â335Updated last year
- Prioritized Experience Replay implementation with proportional prioritizationâ84Updated 2 years ago
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning libraryâ237Updated 8 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.â52Updated 5 months ago
- Single-file pytorch implementation of hybrid-SACâ59Updated 4 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.â163Updated last year
- Author's PyTorch implementation of TD7 for online and offline RLâ149Updated 2 years ago