shreyasc-13 / correlated_bandits
Code for the paper Multi-Armed Bandits with Correlated Arms
☆11Updated 3 years ago
Alternatives and similar repositories for correlated_bandits:
Users that are interested in correlated_bandits are comparing it to the libraries listed below
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆188Updated 2 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆352Updated 3 years ago
- Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning☆94Updated last year
- Constrained Policy Optimization☆317Updated 7 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆196Updated 6 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆166Updated 2 years ago
- Experiments with reinforcement learning and recurrent neural networks☆113Updated last year
- Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'☆60Updated 3 years ago
- Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023☆25Updated last year
- ☆127Updated 8 months ago
- PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…☆70Updated 5 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆487Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆162Updated 4 months ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 8 months ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆92Updated last week
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆297Updated 2 years ago
- Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC☆98Updated 5 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆101Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆177Updated 2 years ago
- DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details☆46Updated 2 years ago
- ☆120Updated 2 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆103Updated 2 years ago
- ☆26Updated 6 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 3 weeks ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆160Updated 4 years ago
- Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms☆152Updated last year
- Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"☆205Updated 5 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆239Updated 4 years ago
- PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…☆87Updated 2 years ago