ChangyWen / wolpertinger_ddpgView external linksLinks
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.
☆66Dec 7, 2022Updated 3 years ago
Alternatives and similar repositories for wolpertinger_ddpg
Users that are interested in wolpertinger_ddpg are comparing it to the libraries listed below
Sorting:
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆178Mar 1, 2018Updated 7 years ago
- python implementation of the TPGR☆40Mar 27, 2019Updated 6 years ago
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- ☆11Feb 22, 2019Updated 6 years ago
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆121Feb 3, 2023Updated 3 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 4 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 9 months ago
- Examples for variational inference☆16Jan 28, 2015Updated 11 years ago
- Explore the potential of recommendation system using reinforcement learning☆15Apr 23, 2020Updated 5 years ago
- This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit☆20Sep 10, 2016Updated 9 years ago
- Round 1 Starter Kit for the MarLo challenge☆21Sep 27, 2018Updated 7 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- ☆19Mar 5, 2018Updated 7 years ago
- Keras implementation of DQN for the MsPacman-v0 OpenAI Gym environment.☆37Dec 8, 2022Updated 3 years ago
- ☆18Apr 17, 2019Updated 6 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago
- Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch☆629Aug 13, 2018Updated 7 years ago
- A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.☆21Nov 29, 2020Updated 5 years ago
- AI for the game Uno☆17Aug 6, 2019Updated 6 years ago
- Customisable Unified Physical Simulations (CUPS) for Reinforcement Learning. Experiments run on the ai2thor environment (http://ai2thor.a…☆51Mar 9, 2020Updated 5 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆88Dec 8, 2022Updated 3 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆54Feb 25, 2025Updated 11 months ago
- Implementation of Scheduled Policy Optimization for task-oriented language grouding☆29Jul 16, 2018Updated 7 years ago
- Research repo of RL☆23Mar 25, 2023Updated 2 years ago
- Population Based Training, Figure 2☆25Dec 2, 2017Updated 8 years ago
- BranchingDQN☆51Jan 30, 2019Updated 7 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆96Mar 1, 2021Updated 4 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- PyTorch implementation of SAC-Discrete.☆314Jul 25, 2024Updated last year
- Pytorch implementation of Soft Actor-Critic☆20Apr 13, 2020Updated 5 years ago
- Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite☆25Oct 11, 2018Updated 7 years ago
- OpenAI Gym Wrapper for DeepMind Control Suite☆74Nov 30, 2021Updated 4 years ago
- Process Simulations Meet AI. Supercharge Your Process Engineering. Generate Infinite Data, Train Advanced Models, and Revolutionise Indus…☆11Oct 8, 2024Updated last year
- TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x☆62Apr 5, 2021Updated 4 years ago
- Deep reinforcement learning for recommendation system☆186Jul 1, 2019Updated 6 years ago
- Revisiting Rainbow☆75Jun 9, 2021Updated 4 years ago