ChangyWen/wolpertinger_ddpg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ChangyWen/wolpertinger_ddpg)

ChangyWen / wolpertinger_ddpg

Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatible.

☆66

Alternatives and similar repositories for wolpertinger_ddpg

Users that are interested in wolpertinger_ddpg are comparing it to the libraries listed below

Sorting:

jimkon / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
View on GitHub
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
☆178Mar 1, 2018Updated 8 years ago
nikhil3456 / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
View on GitHub
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…
☆70Nov 28, 2019Updated 6 years ago
TomZahavy / CB_AE_DQN
View on GitHub
Contextual Bandits Action Elimination DQN
☆21Jun 25, 2018Updated 7 years ago
zoulixin93 / FMCTS
View on GitHub
☆11Feb 22, 2019Updated 7 years ago
hercky / ACER_tf
View on GitHub
Implementation for ACER in tensorflow and sonnet by deepmind
☆11Aug 28, 2017Updated 8 years ago
atavakol / action-branching-agents
View on GitHub
(AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning
☆121Feb 3, 2023Updated 3 years ago
wulfebw / playing_atari
View on GitHub
learning to play atari games with reinforcement learning
☆10Jan 4, 2016Updated 10 years ago
dion-jy / gym-td3-keras
View on GitHub
Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework
☆11May 29, 2021Updated 4 years ago
mcd4874 / Recommendation_system_using_RL_RecSim
View on GitHub
Explore the potential of recommendation system using reinforcement learning
☆15Apr 23, 2020Updated 5 years ago
apexrl / CoDAIL
View on GitHub
Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>
☆19Jun 17, 2021Updated 4 years ago
jvking / reddit-RL-simulator
View on GitHub
This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit
☆20Sep 10, 2016Updated 9 years ago
crowdAI / marlo-single-agent-starter-kit
View on GitHub
Round 1 Starter Kit for the MarLo challenge
☆21Sep 27, 2018Updated 7 years ago
sahandrez / homomorphic_policy_gradient
View on GitHub
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆24Apr 8, 2024Updated last year
ajgupta93 / d4pg-pytorch
View on GitHub
In Progress : State of the art Distributed Distributional Deep Deterministic Policy Gradient algorithm implementation in pytorch.
☆19Jun 15, 2018Updated 7 years ago
Guneet-Dhillon / Stochastic-Activation-Pruning
View on GitHub
☆19Mar 5, 2018Updated 8 years ago
moduIo / Deep-Q-network
View on GitHub
Keras implementation of DQN for the MsPacman-v0 OpenAI Gym environment.
☆37Dec 8, 2022Updated 3 years ago
lns / memoire
View on GitHub
☆18Apr 17, 2019Updated 6 years ago
go2sea / NoisyNetDQN
View on GitHub
Tensorflow implementation for "Noisy network for exploration"
☆19Aug 2, 2017Updated 8 years ago
ghliu / pytorch-ddpg
View on GitHub
Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch
☆630Aug 13, 2018Updated 7 years ago
zplizzi / pytorch-ppo
View on GitHub
Simple, readable, yet full-featured implementation of PPO in Pytorch
☆51Apr 25, 2025Updated 10 months ago
unixpickle / uno-ai
View on GitHub
AI for the game Uno
☆17Aug 6, 2019Updated 6 years ago
BetsyHJ / SOFA
View on GitHub
A TensorFlow implementation of SOFA, the Simulator for OFfline LeArning and evaluation.
☆21Nov 29, 2020Updated 5 years ago
TheMTank / cups-rl
View on GitHub
Customisable Unified Physical Simulations (CUPS) for Reinforcement Learning. Experiments run on the ai2thor environment (http://ai2thor.a…
☆51Mar 9, 2020Updated 6 years ago
TonghanWang / NDQ
View on GitHub
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆88Dec 8, 2022Updated 3 years ago
Jonathan-Pearce / DDPG_PER
View on GitHub
Implementation of Deep Deterministic Policy Gradient (DDPG) with Prioritized Experience Replay (PER)
☆54Feb 25, 2025Updated last year
Rowing0914 / Reinforcement_Learning
View on GitHub
Research repo of RL
☆23Mar 25, 2023Updated 2 years ago
bkj / pbt
View on GitHub
Population Based Training, Figure 2
☆25Dec 2, 2017Updated 8 years ago
xwhan / walk_the_blocks
View on GitHub
Implementation of Scheduled Policy Optimization for task-oriented language grouding
☆29Jul 16, 2018Updated 7 years ago
MoMe36 / BranchingDQN
View on GitHub
BranchingDQN
☆51Jan 30, 2019Updated 7 years ago
wulfebw / muzero
View on GitHub
A python implemenation of tabular MuZero for educational purposes
☆21Dec 11, 2019Updated 6 years ago
LxzGordon / Deep-Reinforcement-Learning-with-pytorch
View on GitHub
Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…
☆96Mar 1, 2021Updated 5 years ago
clvoloshin / constrained_batch_policy_learning
View on GitHub
☆27Oct 25, 2019Updated 6 years ago
toshikwa / sac-discrete.pytorch
View on GitHub
PyTorch implementation of SAC-Discrete.
☆314Jul 25, 2024Updated last year
samlanka / DDPG-PyTorch
View on GitHub
Deep Deterministic Policy Gradient implemented in PyTorch for DeepMind Control Suite
☆25Oct 11, 2018Updated 7 years ago
xlnwel / model-free-algorithms
View on GitHub
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
☆62Apr 5, 2021Updated 4 years ago
luozachary / drl-rec
View on GitHub
Deep reinforcement learning for recommendation system
☆186Jul 1, 2019Updated 6 years ago
johanobandoc / revisiting_rainbow
View on GitHub
Revisiting Rainbow
☆76Jun 9, 2021Updated 4 years ago
xiangni / DREAM
View on GitHub
Deep reinforcement learning for REsource Allocation in streaM processing
☆30Apr 30, 2023Updated 2 years ago
clvrai / FeatureControlHRL-Tensorflow
View on GitHub
A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning
☆32Oct 12, 2017Updated 8 years ago