shreyasc-13 / correlated_bandits

Code for the paper Multi-Armed Bandits with Correlated Arms

☆11

Alternatives and similar repositories for correlated_bandits:

Users that are interested in correlated_bandits are comparing it to the libraries listed below

quanvuong / handful-of-trials-pytorch
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆188Updated 2 years ago
sfujim / TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆352Updated 3 years ago
axelabels / DynMORL
Code for Dynamic Weights in Multi-Objective Deep Reinforcement Learning
☆94Updated last year
jachiam / cpo
Constrained Policy Optimization
☆317Updated 7 years ago
liuzuxin / OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
☆196Updated 6 months ago
Xingyu-Lin / mbpo_pytorch
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆166Updated 2 years ago
AntoineTheb / RNN-RL
Experiments with reinforcement learning and recurrent neural networks
☆113Updated last year
lucaslingle / pytorch_rl2
Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'
☆60Updated 3 years ago
remosasso / PSDRL
Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023
☆25Updated last year
MadryLab / implementation-matters
☆127Updated 8 months ago
nikhil3456 / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…
☆70Updated 5 years ago
jannerm / mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆487Updated 2 years ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆162Updated 4 months ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆133Updated 8 months ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆92Updated last week
AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆297Updated 2 years ago
cyoon1729 / Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆98Updated 5 years ago
evgenii-nikishin / rl_with_resets
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
☆101Updated 2 years ago
tianheyu927 / mopo
Code for MOPO: Model-based Offline Policy Optimization
☆177Updated 2 years ago
vwxyzjn / PPO-Implementation-Deep-Dive
DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details
☆46Updated 2 years ago
qian18long / epciclr2020
☆120Updated 2 years ago
amazon-science / meta-q-learning
Code for the paper "Meta-Q-Learning"( ICLR 2020)
☆103Updated 2 years ago
activatedgeek / qmix
☆26Updated 6 years ago
eric-mitchell / macaw
Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]
☆47Updated 2 years ago
johannah / bootstrap_dqn
Implementation of Bootstrap DQN and Randomized Prior Functions on ALE
☆55Updated 3 weeks ago
aviralkumar2907 / BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆160Updated 4 years ago
vwxyzjn / invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
☆152Updated last year
cycraig / MP-DQN
Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"
☆205Updated 5 years ago
denisyarats / pytorch_sac_ae
PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)
☆239Updated 4 years ago
BY571 / IQN-and-Extensions
PyTorch Implementation of Implicit Quantile Networks (IQN) for Distributional Reinforcement Learning with additional extensions like PER,…
☆87Updated 2 years ago