PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, Ben Coppin).
☆70Nov 28, 2019Updated 6 years ago
Alternatives and similar repositories for Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Users that are interested in Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆66Dec 7, 2022Updated 3 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆178Mar 1, 2018Updated 8 years ago
- ☆11Feb 22, 2019Updated 7 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- MovieLens recommendation system using reinforcement learning (GYM + PPO)☆50Jul 8, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch☆635Aug 13, 2018Updated 7 years ago
- Re-implementation of Progressive Neural Networks with PyTorch☆15Jul 25, 2024Updated last year
- Scripts accompanying the paper "Reinforcement Learning for Sequential Assembly of SL-Blocks"☆13Jun 15, 2021Updated 4 years ago
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Feb 5, 2018Updated 8 years ago
- A pytorch reimplementation of KL-Loss (CVPR'2019)☆15Oct 15, 2023Updated 2 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Apr 3, 2020Updated 6 years ago
- PyTorch implementation of SAC-Discrete.☆316Jul 25, 2024Updated last year
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆121Feb 3, 2023Updated 3 years ago
- A Framework for Safe and Accelerated Reinforcement Learning-based Radio Resource Management☆20Oct 1, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Sources for OpenCL and CUDA tutorials. http://jlaning.com☆20Jan 9, 2016Updated 10 years ago
- Simulation of multiple femtocell network, modeling interference and resource allocations☆17Feb 11, 2015Updated 11 years ago
- Companion code for the paper "Learnable Uncertainty under Laplace Approximations" (UAI 2021).☆19Jun 8, 2021Updated 4 years ago
- ☆21Apr 15, 2024Updated 2 years ago
- Multi-agent Reinforcement Learning Algorithms(COMA, VDN, QMIX)☆16May 24, 2020Updated 5 years ago
- [ICML'23] Official PyTorch Implementation of NA2Q, and a comprehensive benchmark based on pymarl☆22Jan 14, 2024Updated 2 years ago
- Supporting code for "Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration".☆13Jun 18, 2022Updated 3 years ago
- ☆10Sep 9, 2022Updated 3 years ago
- A C++ library to benchmark inverted indexes.☆21Aug 4, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 5G assisted software defined vehicular network for cooperative data sharing☆11Oct 8, 2021Updated 4 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated 2 years ago
- ☆12Jan 4, 2024Updated 2 years ago
- Code for CVPR-W 2020 paper "Hierarchical Image Classification using Entailment Cone Embeddings" https://arxiv.org/abs/2004.03459☆22Feb 2, 2023Updated 3 years ago
- ☆13Oct 29, 2022Updated 3 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- This repository is the implementation of the paper "Beating Atari with Natural Language Guided Reinforcement Learning"☆11Nov 25, 2018Updated 7 years ago
- ☆11May 25, 2023Updated 2 years ago
- ☆11May 27, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementing SDN based VANET simulated on Mininet-Wifi nodes - Socket programming to implement functionality as an application☆10Feb 22, 2019Updated 7 years ago
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆19Jan 24, 2023Updated 3 years ago
- 论文Reinforcement Learning of Sequential Price Mechanisms的复现☆12Nov 3, 2022Updated 3 years ago
- I have developed a custom environment using OpenAI Gym in Python for simulating a 5G wireless communication channel as part of a reinforc…☆13Mar 27, 2024Updated 2 years ago
- ☆22Jan 14, 2021Updated 5 years ago
- 一个基于图神经网络的强化学习网络资源分配模型☆31Mar 14, 2022Updated 4 years ago
- ☆12Jun 17, 2022Updated 3 years ago