Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
☆178Mar 1, 2018Updated 8 years ago
Alternatives and similar repositories for Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Users that are interested in Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆66Dec 7, 2022Updated 3 years ago
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆70Nov 28, 2019Updated 6 years ago
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆276Mar 22, 2018Updated 8 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- Tensorflow implementation for "Generative Adversarial User Model forReinforcement Learning Based Recommendation System"☆131Sep 10, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- Implementation to VirtualTaobao☆13Jan 17, 2020Updated 6 years ago
- ☆11Feb 22, 2019Updated 7 years ago
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆121Feb 3, 2023Updated 3 years ago
- python implementation of the TPGR☆40Mar 27, 2019Updated 7 years ago
- Implementation for our paper in NeurIPS 2019☆48Dec 18, 2019Updated 6 years ago
- The Easiest Pytorch Implementation of Branching-DQN☆12Feb 10, 2021Updated 5 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- The code to reproduce the experimental results for "A Text-based Deep Reinforcement Learning Framework for Interactive Recommendation".☆12Mar 18, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Deep recommendation system☆13Dec 28, 2016Updated 9 years ago
- 5G assisted software defined vehicular network for cooperative data sharing☆11Oct 8, 2021Updated 4 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆23Jan 10, 2019Updated 7 years ago
- A Configurable Recommender Systems Simulation Platform☆783Jan 3, 2022Updated 4 years ago
- A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm☆27Feb 7, 2022Updated 4 years ago
- Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function,…☆64Apr 27, 2017Updated 8 years ago
- ☆54Jul 28, 2019Updated 6 years ago
- Virtual-Taobao simulators with OpenAI Gym interface☆532Nov 18, 2019Updated 6 years ago
- Repository for codes of 'Deep Reinforcement Learning'☆218Oct 4, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,321Sep 25, 2019Updated 6 years ago
- Spring 2017 Deep Reinforcement Learning Final Project☆30May 13, 2017Updated 8 years ago
- Inverse Reinforcement Learning Argorithms☆53May 13, 2019Updated 6 years ago
- 😈 Train ViZDoom agents by Reinforcement Learning 👻☆12Dec 5, 2017Updated 8 years ago
- Python keras + tensorflow implementation of DDPG solving modified open gymAI pendulum-v0 environment☆14Dec 27, 2021Updated 4 years ago
- Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch☆632Aug 13, 2018Updated 7 years ago
- ☆10Sep 3, 2021Updated 4 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 6 years ago
- Reimplementation of DDPG(Continuous Control with Deep Reinforcement Learning) based on OpenAI Gym + Tensorflow☆574Sep 28, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ICML 2018 Self-Imitation Learning☆275Apr 18, 2020Updated 6 years ago
- Deep reinforcement learning for recommendation system☆185Jul 1, 2019Updated 6 years ago
- This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…☆694Dec 18, 2025Updated 4 months ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 5 years ago
- A simple, continuous-control environment for OpenAI Gym☆23Jan 1, 2023Updated 3 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Aug 3, 2020Updated 5 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 4 years ago