PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, Ben Coppin).
☆70Nov 28, 2019Updated 6 years ago
Alternatives and similar repositories for Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Users that are interested in Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces are comparing it to the libraries listed below
Sorting:
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆66Dec 7, 2022Updated 3 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆178Mar 1, 2018Updated 8 years ago
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).☆15Feb 21, 2021Updated 5 years ago
- Re-implementation of Progressive Neural Networks with PyTorch☆15Jul 25, 2024Updated last year
- The Customer Care Bot is a cutting-edge customer support solution designed to revolutionize the way e-commerce websites interact with and…☆12Oct 4, 2023Updated 2 years ago
- We propose a self-driving approach to online index selection that eschews the DBA and query optimiser, and instead learns the benefits of…☆13Jan 8, 2023Updated 3 years ago
- MovieLens recommendation system using reinforcement learning (GYM + PPO)☆50Jul 8, 2020Updated 5 years ago
- A pytorch reimplementation of KL-Loss (CVPR'2019)☆15Oct 15, 2023Updated 2 years ago
- Resource Management with DeepRL using TF Agents☆16Jul 27, 2020Updated 5 years ago
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆19Jan 24, 2023Updated 3 years ago
- This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit☆20Sep 10, 2016Updated 9 years ago
- [ICML'23] Official PyTorch Implementation of NA2Q, and a comprehensive benchmark based on pymarl☆21Jan 14, 2024Updated 2 years ago
- ☆22Jan 14, 2021Updated 5 years ago
- Companion code for the paper "Learnable Uncertainty under Laplace Approximations" (UAI 2021).☆20Jun 8, 2021Updated 4 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated last year
- (AAAI 2018) Action Branching Architectures for Deep Reinforcement Learning☆121Feb 3, 2023Updated 3 years ago
- A C++ library to benchmark inverted indexes.☆21Aug 4, 2020Updated 5 years ago
- Code for CVPR-W 2020 paper "Hierarchical Image Classification using Entailment Cone Embeddings" https://arxiv.org/abs/2004.03459☆22Feb 2, 2023Updated 3 years ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Dec 8, 2022Updated 3 years ago
- ☆27Oct 25, 2019Updated 6 years ago
- Code used for the Arvix report: The Case for Automatic Database Administration using Deep Reinforcement Learning☆25May 13, 2020Updated 5 years ago
- Pytorch implementation of Soft Actor-Critic☆20Apr 13, 2020Updated 5 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- ☆30Aug 11, 2022Updated 3 years ago
- Hierarchical Bayesian inversion of electrochemical impedance spectroscopy (EIS) data☆12Jan 12, 2025Updated last year
- Unveiling the Economics of SQL Operations☆10Apr 21, 2024Updated last year
- 一个基于图神经网络的强化学习网络资源分配模型☆32Mar 14, 2022Updated 3 years ago
- ☆35Aug 17, 2022Updated 3 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- ☆10Oct 31, 2021Updated 4 years ago
- Benchmarking of Session-based Recommendation Approaches☆30Oct 29, 2020Updated 5 years ago
- The best way to practice interview questions☆14Apr 25, 2023Updated 2 years ago
- ☆11May 25, 2023Updated 2 years ago
- A PyTorch implementation of DeepMind's MuZero agent☆37Dec 1, 2023Updated 2 years ago
- ☆42Jan 9, 2025Updated last year
- In this project, we give python and C++ codes for the Ring Polymer Molecular Dynamics (RMPD) to calculate the time correlation function(…☆12Dec 31, 2017Updated 8 years ago
- Pytorch implementation for image compression and reconstruction via autoencoder☆10Jun 17, 2020Updated 5 years ago