Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.
☆65Dec 7, 2022Updated 3 years ago
Alternatives and similar repositories for wolpertinger_ddpg
Users that are interested in wolpertinger_ddpg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆176Mar 1, 2018Updated 8 years ago
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆69Nov 28, 2019Updated 6 years ago
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 8 years ago
- In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.☆19Jun 15, 2018Updated 8 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆122Feb 3, 2023Updated 3 years ago
- BranchingDQN☆51Jan 30, 2019Updated 7 years ago
- OAI Network Service in OSM☆12Sep 13, 2025Updated 9 months ago
- Explore the potential of recommendation system using reinforcement learning☆15Apr 23, 2020Updated 6 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated 2 years ago
- Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)☆53Apr 29, 2026Updated 2 months ago
- ☆10Aug 14, 2020Updated 5 years ago
- Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch☆632Aug 13, 2018Updated 7 years ago
- Official Implementation of Multi-Masked Aggregators for Graph Neural Networks in Pytorch and PyTorch Geometric☆11Mar 24, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆11Feb 28, 2023Updated 3 years ago
- Deep reinforcement learning for REsource Allocation in streaM processing☆30Apr 30, 2023Updated 3 years ago
- Pytorch implementation of Soft Actor-Critic☆20Apr 13, 2020Updated 6 years ago
- Resource Management with DeepRL using TF Agents☆16Jul 27, 2020Updated 5 years ago
- Re-implementation of Exploiting Edge Features in Graph Neural Networks☆11Apr 7, 2022Updated 4 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 5 years ago
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆97Mar 1, 2021Updated 5 years ago
- An extension of deeplab-v2 (in TF) allowing for smoothed dilated convolutions☆12Mar 27, 2019Updated 7 years ago
- ☆19Mar 5, 2018Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 5 years ago
- Artifact for paper "Chronosymbolic: Efficient CHC Solving with Symbolic Reasoning and Inductive Learning" in Python☆11Aug 4, 2024Updated last year
- A bipedal humanoid control system using a Physics-Informed Neural Network (PINN) and Reinforcement Learning (RL) for stability and manipu…☆13Mar 25, 2026Updated 3 months ago
- Multi-Agent training using Deep Deterministic Policy Gradient Networks, Solving the Tennis Environment☆11Oct 20, 2018Updated 7 years ago
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- Supporting code for "Learning to Solve Combinatorial Graph Partitioning Problems via Efficient Exploration".☆13Jun 18, 2022Updated 4 years ago
- ☆10Sep 9, 2022Updated 3 years ago
- ☆18Apr 17, 2019Updated 7 years ago
- Meta-Reinforcement Learning with Policy Residual Representation☆11Aug 15, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆89Dec 8, 2022Updated 3 years ago
- 关于AI,ML,DA,DV等的几个经典案例,包括堵车模拟(NagelSchreckenberg)、蒙特卡洛排队问题(Monte Carlo Queuing Problem)、人脸识别(RecognitionFace)、遗传算法推断图像(IconGenetic)☆10Oct 14, 2018Updated 7 years ago
- A collection of DPP code and other diverse sampling algorithms☆10Nov 12, 2014Updated 11 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Aug 23, 2018Updated 7 years ago
- CS234 Reinforcement Learning: Keras implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455)☆10Jun 10, 2017Updated 9 years ago
- fork from https://github.com/scutan90/DeepLearning-500-questions☆11Jan 3, 2019Updated 7 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago