PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, Ben Coppin).
☆70Nov 28, 2019Updated 6 years ago
Alternatives and similar repositories for Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Users that are interested in Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆66Dec 7, 2022Updated 3 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆178Mar 1, 2018Updated 8 years ago
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- BranchingDQN☆51Jan 30, 2019Updated 7 years ago
- ☆11Feb 22, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- MovieLens recommendation system using reinforcement learning (GYM + PPO)☆50Jul 8, 2020Updated 5 years ago
- Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch☆632Aug 13, 2018Updated 7 years ago
- Re-implementation of Progressive Neural Networks with PyTorch☆15Jul 25, 2024Updated last year
- ☆12Nov 1, 2019Updated 6 years ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆27Feb 11, 2025Updated last year
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Feb 5, 2018Updated 8 years ago
- A pytorch reimplementation of KL-Loss (CVPR'2019)☆15Oct 15, 2023Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Apr 3, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- In this work, we propose a novel formulation titled Federated Deep Q Networks (F-DQN) to perform distributed learning for Deep RL algorit…☆21Dec 25, 2020Updated 5 years ago
- PyTorch implementation of SAC-Discrete.☆316Jul 25, 2024Updated last year
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆121Feb 3, 2023Updated 3 years ago
- [ICML'23] Official PyTorch Implementation of NA2Q, and a comprehensive benchmark based on pymarl☆21Jan 14, 2024Updated 2 years ago
- Sources for OpenCL and CUDA tutorials. http://jlaning.com☆20Jan 9, 2016Updated 10 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆96Mar 1, 2021Updated 5 years ago
- Companion code for the paper "Learnable Uncertainty under Laplace Approximations" (UAI 2021).☆19Jun 8, 2021Updated 4 years ago
- Supporting code for "Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration".☆13Jun 18, 2022Updated 3 years ago
- 5G assisted software defined vehicular network for cooperative data sharing☆11Oct 8, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated 2 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- Distributed & asynchronous DQN implementation using gRPC and PyTorch.☆10Feb 15, 2021Updated 5 years ago
- This repository is the implementation of the paper "Beating Atari with Natural Language Guided Reinforcement Learning"☆11Nov 25, 2018Updated 7 years ago
- ☆11May 27, 2019Updated 6 years ago
- Deep reinforcement learning for REsource Allocation in streaM processing☆30Apr 30, 2023Updated 2 years ago
- ☆14Dec 14, 2022Updated 3 years ago
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆19Jan 24, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- I have developed a custom environment using OpenAI Gym in Python for simulating a 5G wireless communication channel as part of a reinforc…☆13Mar 27, 2024Updated 2 years ago
- ☆22Jan 14, 2021Updated 5 years ago
- 一个基于图神经网络的强化学习网络资源分配模型☆31Mar 14, 2022Updated 4 years ago
- ☆34Aug 17, 2022Updated 3 years ago
- ☆12Jun 17, 2022Updated 3 years ago
- Code for the paper "Optimal Off-Policy Evaluation from Multiple Logging Policies"☆15Jul 17, 2021Updated 4 years ago
- Incremental Mobile User Profiling: Reinforcement Learning with Spatial Knowledge Graph for Modeling Event Streams☆15Jul 25, 2024Updated last year