nikhil3456 / Deep-Reinforcement-Learning-in-Large-Discrete-Action-SpacesView external linksLinks
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, Ben Coppin).
☆70Nov 28, 2019Updated 6 years ago
Alternatives and similar repositories for Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
Users that are interested in Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces are comparing it to the libraries listed below
Sorting:
- Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…☆66Dec 7, 2022Updated 3 years ago
- Implementation of the algorithm in Python 3, TensorFlow and OpenAI Gym☆178Mar 1, 2018Updated 7 years ago
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- Re-implementation of Progressive Neural Networks with PyTorch☆15Jul 25, 2024Updated last year
- We propose a self-driving approach to online index selection that eschews the DBA and query optimiser, and instead learns the benefits of…☆13Jan 8, 2023Updated 3 years ago
- ☆12Nov 1, 2019Updated 6 years ago
- The Customer Care Bot is a cutting-edge customer support solution designed to revolutionize the way e-commerce websites interact with and…☆12Oct 4, 2023Updated 2 years ago
- BranchingDQN☆51Jan 30, 2019Updated 7 years ago
- MovieLens recommendation system using reinforcement learning (GYM + PPO)☆50Jul 8, 2020Updated 5 years ago
- Resource Management with DeepRL using TF Agents☆15Jul 27, 2020Updated 5 years ago
- Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and obse…☆19Jan 24, 2023Updated 3 years ago
- A primer to Uniform Manifold Approximation and Projection (UMAP)☆15Jun 1, 2018Updated 7 years ago
- A framework for evolving and testing question-answering datasets with various models.☆21Feb 28, 2024Updated last year
- [ICML'23] Official PyTorch Implementation of NA2Q, and a comprehensive benchmark based on pymarl☆21Jan 14, 2024Updated 2 years ago
- Defending Bicyclists from Erratic Drivers with Computer Vision and mmWave Radar☆18Apr 22, 2024Updated last year
- This repository provides simulator codes for predicting and tracking popular discussion threads on Reddit☆20Sep 10, 2016Updated 9 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated last year
- Companion code for the paper "Learnable Uncertainty under Laplace Approximations" (UAI 2021).☆20Jun 8, 2021Updated 4 years ago
- Sources for OpenCL and CUDA tutorials. http://jlaning.com☆20Jan 9, 2016Updated 10 years ago
- ☆22Jan 14, 2021Updated 5 years ago
- Code for CVPR-W 2020 paper "Hierarchical Image Classification using Entailment Cone Embeddings" https://arxiv.org/abs/2004.03459☆22Feb 2, 2023Updated 3 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 4 years ago
- In this work, we propose a novel formulation titled Federated Deep Q Networks (F-DQN) to perform distributed learning for Deep RL algorit…☆21Dec 25, 2020Updated 5 years ago
- Computational statistics and machine learning reading group at Imperial College London (2019-2020)☆25Jan 9, 2026Updated last month
- Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…☆96Mar 1, 2021Updated 4 years ago
- Code used for the Arvix report: The Case for Automatic Database Administration using Deep Reinforcement Learning☆25May 13, 2020Updated 5 years ago
- Data Structures with Python(AIX20001) 강의 자료실☆18Jun 14, 2024Updated last year
- This Python project integrates MetaTrader5 with GPT-4 to generate automated trading signals. It analyzes OHLC and tick data to provide re…☆12Aug 25, 2024Updated last year
- PyTorch implementation of SAC-Discrete.☆314Jul 25, 2024Updated last year
- Pytorch implementation of Soft Actor-Critic☆20Apr 13, 2020Updated 5 years ago
- ☆13Dec 4, 2025Updated 2 months ago
- A Terminal User Interface (TUI) application that enables interactive conversations with your documents using Large Language Models (LLM) …☆13Dec 11, 2024Updated last year
- Estimate probability of failure using reframed Bayesian optimization☆10Aug 14, 2025Updated 6 months ago
- Unveiling the Economics of SQL Operations☆10Apr 21, 2024Updated last year
- ☆35Aug 17, 2022Updated 3 years ago
- Deep reinforcement learning for REsource Allocation in streaM processing☆31Apr 30, 2023Updated 2 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 9 months ago
- Datasets and code to accompany Briceno-Mena, Luis A. and Venugopalan, Gokul and Romagnoli, José A. and Arges, Christopher G., Machine Lea…☆10Oct 17, 2022Updated 3 years ago