yashchandak / OptFuture_NSMDP
Optimizing for the Future in Non-Stationary MDPs
☆10Updated 2 years ago
Alternatives and similar repositories for OptFuture_NSMDP:
Users that are interested in OptFuture_NSMDP are comparing it to the libraries listed below
- Code for Expert Supervised Reinforcement Learning☆10Updated 4 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Updated 5 years ago
- ☆14Updated 5 years ago
- Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.☆45Updated 4 years ago
- ☆43Updated 3 years ago
- ☆15Updated 4 years ago
- Dead-ends and Secure Exploration in Reinforcement Learning☆11Updated 5 years ago
- PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020☆40Updated 4 years ago
- An official JAX-based code for our NeuraLCB paper, "Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization", ICLR…☆14Updated 3 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆16Updated 3 years ago
- ☆32Updated 8 months ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆40Updated 5 months ago
- Representation Learning in RL☆16Updated 2 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 4 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆34Updated 5 years ago
- Bayes-Adaptive Monte-Carlo Planning algorithm☆16Updated 12 years ago
- Deconfounding Reinforcement Learning in Observational Settings☆51Updated 5 years ago
- Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... …☆16Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆17Updated 3 years ago
- Ordered Preference Elicitation Strategies for Multi-Objective Decision Making using Gaussian Processes☆23Updated 6 years ago
- ☆17Updated 6 years ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆53Updated 4 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆60Updated 5 years ago
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆13Updated last year
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 5 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆38Updated 4 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆11Updated 4 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆17Updated 5 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆28Updated 2 years ago