jihoonerd / Human-level-control-through-deep-reinforcement-learningLinks

📖 Paper: Human-level control through deep reinforcement learning 🕹️

☆51

Alternatives and similar repositories for Human-level-control-through-deep-reinforcement-learning

Users that are interested in Human-level-control-through-deep-reinforcement-learning are comparing it to the libraries listed below

Sorting:

ChienFeng-hub / meow
[NeurIPS 2024] Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow
☆37Updated 8 months ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆114Updated last year
alirezakazemipour / DDPG-HER
Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.
☆99Updated 2 months ago
araffin / rl-handson-rlvs21
Stable-Baselines3 (SB3) reinforcement learning tutorial for the Reinforcement Learning Virtual School 2021.
☆54Updated 2 years ago
Improbable-AI / eipo
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
☆82Updated 2 years ago
compsciencelab / ppo_D
This is the official repository for the paper "Guided Exploration with Proximal Policy Optimization using a Single Demonstration", https:…
☆19Updated 3 years ago
Howuhh / prioritized_experience_replay
Prioritized Experience Replay implementation with proportional prioritization
☆81Updated last year
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆170Updated 8 months ago
CUN-bjy / policy-distillation-baselines
Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.
☆59Updated 4 years ago
toshikwa / gail-airl-ppo.pytorch
PyTorch implementation of GAIL and AIRL based on PPO.
☆223Updated 4 years ago
alirezakazemipour / PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆51Updated 2 months ago
fschur / DDQN-with-PyTorch-for-OpenAI-Gym
Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.
☆69Updated last month
jakegrigsby / deep_control
Deep Reinforcement Learning for Continuous Control in PyTorch
☆100Updated 3 years ago
hcnoh / gail-pytorch
A simple implementation of Generative Adversarial Imitation Learning with PyTorch
☆162Updated 3 years ago
CherryPieSexy / imitation_learning
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
☆145Updated 3 years ago
BY571 / IQN-and-Extensions
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆89Updated 2 years ago
shehryar-malik / icrl
Inverse Constrained Reinforcement Learning (ICML 2021)
☆24Updated 3 years ago
alirezakazemipour / SAC
Implementation of Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor.
☆28Updated 2 months ago
montaserFath / BCO
behavior cloning from observation
☆35Updated 4 years ago
DLR-RM / rl-trained-agents
A collection of pre-trained RL agents using Stable Baselines3
☆130Updated 8 months ago
XinJingHao / PPO-Discrete-Pytorch
A clean and robust Pytorch implementation of PPO on Discrete action space
☆70Updated last year
BY571 / CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…
☆137Updated last year
qingshi9974 / PPO-pytorch-Mujoco
Implement PPO algorithm on mujoco environment，such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.
☆53Updated 5 years ago
vincent-thevenin / DreamerV2-Pytorch
Pytorch implementation of DreamerV2: MASTERING ATARI WITH DISCRETE WORLD MODELS
☆50Updated 3 years ago
datvodinh / ppo-transformer
A Reinforcement Learning Project using PPO + Transformer
☆59Updated last year
Farama-Foundation / gym-examples
Example code for the Gym documentation
☆72Updated 2 years ago
schneimo / ddpg-pytorch
PyTorch implementation of DDPG for continuous control tasks.
☆46Updated 5 years ago
yashbonde / Transformer-RL
Experiments to train transformer network to master reinforcement learning environments.
☆32Updated 4 years ago
seolhokim / Mujoco-Pytorch
PPO, DDPG, SAC implementation on mujoco environment
☆112Updated 3 years ago
MarcoMeter / episodic-transformer-memory-ppo
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆183Updated last year