PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, Ben Coppin).
☆70Nov 28, 2019Updated 6 years ago
Alternatives and similar repositories for Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Users that are interested in Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆66Dec 7, 2022Updated 3 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆177Mar 1, 2018Updated 8 years ago
- Resource Management with DeepRL using TF Agents☆16Jul 27, 2020Updated 5 years ago
- BranchingDQN☆51Jan 30, 2019Updated 7 years ago
- We propose a self-driving approach to online index selection that eschews the DBA and query optimiser, and instead learns the benefits of…☆13Jan 8, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Feb 22, 2019Updated 7 years ago
- MovieLens recommendation system using reinforcement learning (GYM + PPO)☆49Jul 8, 2020Updated 5 years ago
- Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch☆634Aug 13, 2018Updated 7 years ago
- OAI Network Service in OSM☆12Sep 13, 2025Updated 8 months ago
- A pytorch reimplementation of KL-Loss (CVPR'2019)☆15Oct 15, 2023Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Apr 3, 2020Updated 6 years ago
- In this work, we propose a novel formulation titled Federated Deep Q Networks (F-DQN) to perform distributed learning for Deep RL algorit…☆21Dec 25, 2020Updated 5 years ago
- PyTorch implementation of SAC-Discrete.☆316Jul 25, 2024Updated last year
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆121Feb 3, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Sources for OpenCL and CUDA tutorials. http://jlaning.com☆20Jan 9, 2016Updated 10 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆97Mar 1, 2021Updated 5 years ago
- [ICML'23] Official PyTorch Implementation of NA2Q, and a comprehensive benchmark based on pymarl☆23Jan 14, 2024Updated 2 years ago
- Supporting code for "Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration".☆13Jun 18, 2022Updated 3 years ago
- ☆10Sep 9, 2022Updated 3 years ago
- A C++ library to benchmark inverted indexes.☆21Aug 4, 2020Updated 5 years ago
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated 2 years ago
- Code for CVPR-W 2020 paper "Hierarchical Image Classification using Entailment Cone Embeddings" https://arxiv.org/abs/2004.03459☆22Feb 2, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆27Oct 25, 2019Updated 6 years ago
- Distributed & asynchronous DQN implementation using gRPC and PyTorch.☆10Feb 15, 2021Updated 5 years ago
- Creative Idea; Semantic Communcation; SwinJSCC; not but not want publish☆23Aug 7, 2025Updated 9 months ago
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆17Sep 23, 2022Updated 3 years ago
- ☆11May 25, 2023Updated 3 years ago
- Deep reinforcement learning for REsource Allocation in streaM processing☆30Apr 30, 2023Updated 3 years ago
- ☆14Dec 14, 2022Updated 3 years ago
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆19Jan 24, 2023Updated 3 years ago
- I have developed a custom environment using OpenAI Gym in Python for simulating a 5G wireless communication channel as part of a reinforc…☆14Mar 27, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 一个基于图神经网络的强化学习网络资源分配模型☆31Mar 14, 2022Updated 4 years ago
- ☆35Aug 17, 2022Updated 3 years ago
- ☆12Jun 17, 2022Updated 3 years ago
- Code for the paper "Optimal Off-Policy Evaluation from Multiple Logging Policies"☆15Jul 17, 2021Updated 4 years ago
- ☆18Jul 11, 2024Updated last year
- PPO with multi-head/autoregressive action outputs☆47Mar 4, 2021Updated 5 years ago
- Guarantee_Learning_Control☆11Sep 5, 2019Updated 6 years ago