tjwhitaker / human-level-control-through-deep-reinforcement-learningLinks
Deep Q Networks
☆91Updated 7 years ago
Alternatives and similar repositories for human-level-control-through-deep-reinforcement-learning
Users that are interested in human-level-control-through-deep-reinforcement-learning are comparing it to the libraries listed below
Sorting:
- DQN-Atari-Agents: Modularized & Parallel PyTorch implementation of several DQN Agents, i.a. DDQN, Dueling DQN, Noisy DQN, C51, Rainbow,…☆121Updated 4 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆30Updated 7 months ago
- Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.☆214Updated 2 years ago
- ☆186Updated 3 years ago
- 📖 Paper: Deep Reinforcement Learning with Double Q-learning 🕹️☆58Updated last year
- Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotli…☆137Updated 4 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Updated 4 years ago
- PyTorch implementation of SAC-Discrete.☆312Updated last year
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Updated 2 years ago
- Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)☆230Updated 3 years ago
- A clean and robust Pytorch implementation of PPO on Discrete action space☆72Updated last year
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆52Updated 7 months ago
- Implementation of Double DQN reinforcement learning for OpenAI Gym environments with PyTorch.☆70Updated 6 months ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆356Updated 5 years ago
- Online Decision Transformer☆274Updated last year
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆85Updated 3 years ago
- very easy implementation of dueling DQN in pytorch☆74Updated 3 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆198Updated last year
- Prioritized Experience Replay implementation with proportional prioritization☆85Updated 2 years ago
- Applying minimaxQ learning algorithm to 2 agents games☆33Updated 8 years ago
- A clean and robust Pytorch implementation of PPO on continuous action space.☆168Updated last year
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning library☆243Updated 10 months ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆338Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆95Updated 2 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆145Updated 2 years ago
- A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"☆130Updated 2 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆90Updated 2 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆198Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC)☆574Updated 4 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆363Updated 2 years ago