Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.
☆66Dec 7, 2022Updated 3 years ago
Alternatives and similar repositories for wolpertinger_ddpg
Users that are interested in wolpertinger_ddpg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆178Mar 1, 2018Updated 8 years ago
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆70Nov 28, 2019Updated 6 years ago
- python implementation of the TPGR☆40Mar 27, 2019Updated 7 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 7 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆121Feb 3, 2023Updated 3 years ago
- BranchingDQN☆51Jan 30, 2019Updated 7 years ago
- Explore the potential of recommendation system using reinforcement learning☆15Apr 23, 2020Updated 5 years ago
- Round 1 Starter Kit for the MarLo challenge☆21Sep 27, 2018Updated 7 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated last year
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆19Jun 17, 2021Updated 4 years ago
- Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch☆631Aug 13, 2018Updated 7 years ago
- just for fun☆14Mar 11, 2018Updated 8 years ago
- Tools for manipulating CHC and related files☆15Apr 21, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Deep reinforcement learning for REsource Allocation in streaM processing☆30Apr 30, 2023Updated 2 years ago
- Repository for the code used to create the results of the paper "MaMIMO CSI-based positioning using CNNs: Peeking inside the black box"☆11Jan 12, 2024Updated 2 years ago
- Pytorch implementation of Soft Actor-Critic☆20Apr 13, 2020Updated 5 years ago
- MuJoCo benchmark for Deep Reinforcement Learning as provided by Tianshou framework.☆15Jan 12, 2025Updated last year
- Resource Management with DeepRL using TF Agents☆16Jul 27, 2020Updated 5 years ago
- Re-implementation of Exploiting Edge Features in Graph Neural Networks☆11Apr 7, 2022Updated 3 years ago
- Minimal end-to-end LTE using srsRAN. Dockerized and emulated radio over shared memory.☆11Jun 7, 2021Updated 4 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 5 years ago
- An extension of deeplab-v2 (in TF) allowing for smoothed dilated convolutions☆12Mar 27, 2019Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆19Mar 5, 2018Updated 8 years ago
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 5 years ago
- A bipedal humanoid control system using a Physics-Informed Neural Network (PINN) and Reinforcement Learning (RL) for stability and manipu…☆12Updated this week
- A Python implementation of the SARSA λ reinforcement learning algorithm☆12Mar 6, 2019Updated 7 years ago
- Multi-Agent training using Deep Deterministic Policy Gradient Networks, Solving the Tennis Environment☆11Oct 20, 2018Updated 7 years ago
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- Supporting code for "Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration".☆13Jun 18, 2022Updated 3 years ago
- ☆10Sep 9, 2022Updated 3 years ago
- ☆18Apr 17, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- Implementation of a maximum area coverage algorithm in MATLAB☆14Dec 27, 2020Updated 5 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆88Dec 8, 2022Updated 3 years ago
- 人工智能:爬山法、随机重启爬山法、模拟退火算法、遗传算法、启发式搜索方法解决八数码和八皇后问题☆11Jul 15, 2021Updated 4 years ago
- A collection of DPP code and other diverse sampling algorithms☆10Nov 12, 2014Updated 11 years ago
- ☆11May 27, 2019Updated 6 years ago