ambujtewari / stats701-winter2021
Theory of Reinforcement Learning
☆16Updated 3 years ago
Related projects: ⓘ
- ☆28Updated 3 months ago
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 3 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆141Updated last year
- ☆85Updated last month
- ☆39Updated 3 years ago
- ☆115Updated last month
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆24Updated 2 years ago
- Benchmarked implementations of Offline RL Algorithms.☆62Updated 4 months ago
- ☆44Updated last year
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago
- ☆26Updated 4 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆64Updated 2 years ago
- ☆53Updated 6 months ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆51Updated 3 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆68Updated 2 years ago
- Author's PyTorch implementation of LAP and PAL with TD3 and DDQN☆33Updated 2 years ago
- Model-Based Offline Reinforcement Learning☆47Updated 3 years ago
- Offline Reinforcement Learning Reading Group☆24Updated last year
- Paper Collection for Batch RL with brief introductions.☆84Updated 2 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆61Updated 2 years ago
- ☆14Updated 4 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Updated 2 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆40Updated 3 months ago
- ☆28Updated 2 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆70Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆119Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆169Updated 2 years ago
- Experiment code for testing effect of various action space transformations in reinforcement learning☆28Updated 4 years ago