tjwhitaker / human-level-control-through-deep-reinforcement-learningLinks
Deep Q Networks
☆88Updated 7 years ago
Alternatives and similar repositories for human-level-control-through-deep-reinforcement-learning
Users that are interested in human-level-control-through-deep-reinforcement-learning are comparing it to the libraries listed below
Sorting:
- Various ways to learn a computer to escape from a maze. From random walk to a simple neural network.☆103Updated 3 years ago
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆32Updated 4 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆61Updated 2 years ago
- Experiments with reinforcement learning and recurrent neural networks☆115Updated last year
- 📖 Paper: Deep Reinforcement Learning with Double Q-learning 🕹️☆56Updated last year
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆69Updated 4 months ago
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning library☆238Updated 8 months ago
- A PyTorch implementation of DeepMind's MuZero agent☆36Updated last year
- 📖 Paper: Human-level control through deep reinforcement learning 🕹️☆51Updated last year
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 5 months ago
- Online Decision Transformer☆270Updated last year
- Lightweight multi-agent gridworld Gym environment☆210Updated 2 years ago
- Code for "Proximal Distilled Evolutionary Reinforcement Learning", accepted at AAAI 2020☆55Updated last year
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29Updated 5 months ago
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆94Updated 2 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆192Updated last year
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆135Updated 4 years ago
- ☆184Updated 3 years ago
- ☆33Updated 4 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆105Updated 3 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆181Updated 2 years ago
- Proximal policy optimization in PyTorch. Easy to read and understand.☆51Updated 4 years ago
- Tensorflow implementation of the asynchronous advantage actor-critic (a3c) reinforcement learning algorithm for continuous action space☆46Updated 8 years ago
- Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)☆164Updated 2 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆69Updated last year
- Curiosity-driven Exploration by Self-supervised Prediction☆141Updated 2 years ago
- On-Policy Policy Gradient Algorithms in JAX☆40Updated last year
- Official code repo for the MARL book (www.marl-book.com)☆555Updated 6 months ago
- Transformers are Meta-Reinforcement Learners - International Conference on Machine Learning (ICML) 2022☆61Updated 2 years ago