nikhil3456/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nikhil3456/Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces)

nikhil3456 / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces

PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, Ben Coppin).

☆70

Alternatives and similar repositories for Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces

Users that are interested in Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces are comparing it to the libraries listed below

Sorting:

ChangyWen / wolpertinger_ddpg
View on GitHub
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…
☆66Dec 7, 2022Updated 3 years ago
jimkon / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
View on GitHub
Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym
☆178Mar 1, 2018Updated 8 years ago
TomZahavy / CB_AE_DQN
View on GitHub
Contextual Bandits Action Elimination DQN
☆21Jun 25, 2018Updated 7 years ago
LucasAlegre / sac-plus
View on GitHub
Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).
☆15Feb 21, 2021Updated 5 years ago
hengdashi / pnn
View on GitHub
Re-implementation of Progressive Neural Networks with PyTorch
☆15Jul 25, 2024Updated last year
subhan97ahmed / customer-care-bot
View on GitHub
The Customer Care Bot is a cutting-edge customer support solution designed to revolutionize the way e-commerce websites interact with and…
☆12Oct 4, 2023Updated 2 years ago
malingaperera / DBABandits
View on GitHub
We propose a self-driving approach to online index selection that eschews the DBA and query optimiser, and instead learns the benefits of…
☆13Jan 8, 2023Updated 3 years ago
sadighian / recommendation-gym
View on GitHub
MovieLens recommendation system using reinforcement learning (GYM + PPO)
☆50Jul 8, 2020Updated 5 years ago
cxliu0 / KL-Loss-pytorch
View on GitHub
A pytorch reimplementation of KL-Loss (CVPR'2019)
☆15Oct 15, 2023Updated 2 years ago
tawfiqul-islam / RM_DeepRL
View on GitHub
Resource Management with DeepRL using TF Agents
☆16Jul 27, 2020Updated 5 years ago
DHDev0 / Muzero
View on GitHub
Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…
☆19Jan 24, 2023Updated 3 years ago
jvking / reddit-RL-simulator
View on GitHub
This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit
☆20Sep 10, 2016Updated 9 years ago
zichuan-liu / NA2Q
View on GitHub
[ICML'23] Official PyTorch Implementation of NA2Q, and a comprehensive benchmark based on pymarl
☆21Jan 14, 2024Updated 2 years ago
StatsDLMathsRecomSys / Adversarial-Counterfactual-Learning-and-Evaluation-for-Recommender-System
View on GitHub
☆22Jan 14, 2021Updated 5 years ago
wiseodd / lula
View on GitHub
Companion code for the paper "Learnable Uncertainty under Laplace Approximations" (UAI 2021).
☆20Jun 8, 2021Updated 4 years ago
sahandrez / homomorphic_policy_gradient
View on GitHub
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆24Apr 8, 2024Updated last year
atavakol / action-branching-agents
View on GitHub
(AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning
☆121Feb 3, 2023Updated 3 years ago
jermp / 2i_bench
View on GitHub
A C++ library to benchmark inverted indexes.
☆21Aug 4, 2020Updated 5 years ago
ankitdhall / learning_embeddings
View on GitHub
Code for CVPR-W 2020 paper "Hierarchical Image Classification using Entailment Cone Embeddings" https://arxiv.org/abs/2004.03459
☆22Feb 2, 2023Updated 3 years ago
faameunier / MCTSnet
View on GitHub
A PyTorch implementation of DeepMind's MCTSnet
☆18Dec 8, 2022Updated 3 years ago
clvoloshin / constrained_batch_policy_learning
View on GitHub
☆27Oct 25, 2019Updated 6 years ago
shankur / autoindex
View on GitHub
Code used for the Arvix report: The Case for Automatic Database Administration using Deep Reinforcement Learning
☆25May 13, 2020Updated 5 years ago
kushagra06 / SAC
View on GitHub
Pytorch implementation of Soft Actor-Critic
☆20Apr 13, 2020Updated 5 years ago
rehbergT / dgemm
View on GitHub
Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…
☆13Nov 27, 2022Updated 3 years ago
balmasi / g2_reviews_llm_topic_modeling
View on GitHub
An experiment to see if we can process G2 reviews to extract topics from reviews
☆10Feb 5, 2024Updated 2 years ago
PredictiveIntelligenceLab / JAX-BO
View on GitHub
☆30Aug 11, 2022Updated 3 years ago
jdhuang-csm / bayes-drt2
View on GitHub
Hierarchical Bayesian inversion of electrochemical impedance spectroscopy (EIS) data
☆12Jan 12, 2025Updated last year
siddhantpathakk / postgreSQL-cost-estimator
View on GitHub
Unveiling the Economics of SQL Operations
☆10Apr 21, 2024Updated last year
Horn1998 / RL_GNN
View on GitHub
一个基于图神经网络的强化学习网络资源分配模型
☆32Mar 14, 2022Updated 3 years ago
linkpark / pomdp-service-migration
View on GitHub
☆35Aug 17, 2022Updated 3 years ago
JimOhman / model-based-rl
View on GitHub
Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).
☆33Aug 14, 2022Updated 3 years ago
FabianGabriel / Active_flow_control_past_cylinder_using_DRL
View on GitHub
☆10Oct 31, 2021Updated 4 years ago
mmaher22 / iCV-SBR
View on GitHub
Benchmarking of Session-based Recommendation Approaches
☆30Oct 29, 2020Updated 5 years ago
InterviewPal / InterviewPal
View on GitHub
The best way to practice interview questions
☆14Apr 25, 2023Updated 2 years ago
JFChi / PLUE
View on GitHub
☆11May 25, 2023Updated 2 years ago
michaelnny / muzero
View on GitHub
A PyTorch implementation of DeepMind's MuZero agent
☆37Dec 1, 2023Updated 2 years ago
ruvnet / inflight
View on GitHub
☆42Jan 9, 2025Updated last year
ks838 / Ring-Polymer-Molecular-Dynamics-in-Python-and-cpp
View on GitHub
In this project, we give python and C++ codes for the Ring Polymer Molecular Dynamics (RMPD) to calculate the time correlation function(…
☆12Dec 31, 2017Updated 8 years ago
micah35s / Autoencoder-Image-Compression
View on GitHub
Pytorch implementation for image compression and reconstruction via autoencoder
☆10Jun 17, 2020Updated 5 years ago