jihoonerd / Human-level-control-through-deep-reinforcement-learning
📖 Paper: Human-level control through deep reinforcement learning 🕹️
☆37Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for Human-level-control-through-deep-reinforcement-learning
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆296Updated 2 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆161Updated 2 months ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆93Updated 2 years ago
- Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.☆89Updated last year
- Lightweight multi-agent gridworld Gym environment☆198Updated last year
- PyTorch implementation of GAIL and AIRL based on PPO.☆198Updated 4 years ago
- Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.☆54Updated 3 years ago
- A simple implementation of Generative Adversarial Imitation Learning with PyTorch☆135Updated 2 years ago
- PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.☆140Updated 3 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆163Updated 2 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆331Updated last year
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning library☆208Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆328Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆152Updated last week
- A clean and robust Pytorch implementation of PPO on Discrete action space☆59Updated 5 months ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆132Updated 3 months ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆146Updated 7 months ago
- PyTorch implementation of DDPG for continuous control tasks.☆44Updated 4 years ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated last year
- Code for "Temporal Difference Learning for Model Predictive Control"☆362Updated 11 months ago
- Experiments with reinforcement learning and recurrent neural networks☆113Updated last year
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆271Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆226Updated 4 years ago
- (NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation☆204Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC)☆515Updated 2 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆176Updated 2 months ago
- PyTorch implementation of SAC-Discrete.☆284Updated 3 months ago
- PyTorch implementation of Soft Actor-Critic(SAC).☆98Updated 4 years ago
- Stable-Baselines3 (SB3) reinforcement learning tutorial for the Reinforcement Learning Virtual School 2021.☆50Updated last year
- Implementation of Trajectory Transformer with attention caching and batched beam search☆107Updated last year