shamanez / VUSFA-Variational-Universal-Successor-Features-ApproximatorLinks
This repository contains implementations of the paper VUSFA
β14Updated 4 years ago
Alternatives and similar repositories for VUSFA-Variational-Universal-Successor-Features-Approximator
Users that are interested in VUSFA-Variational-Universal-Successor-Features-Approximator are comparing it to the libraries listed below
Sorting:
- π§Ά Minimal PyTorch Soft Actor Critic (SAC) implementationβ38Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradientsβ32Updated 5 years ago
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimationβ39Updated 8 months ago
- Generalised UDRLβ37Updated 3 years ago
- Disagreement-Regularized Imitation Learningβ30Updated 4 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewardsβ28Updated 6 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"β44Updated last year
- Variational Reinforcement Learningβ16Updated 11 months ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]β39Updated 2 years ago
- Code for Multi-Agent Common Knowledge Reinforcement Learning (NeurIPS 2019)β33Updated 5 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.β36Updated 4 years ago
- ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependenciesβ18Updated 4 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variablesβ73Updated 2 years ago
- β21Updated last year
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.β56Updated 6 years ago
- using information theory to encourage agents to cooperate and competeβ19Updated 6 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimizationβ24Updated 5 years ago
- Docker containers of baseline agents for the Crafter environmentβ28Updated 3 years ago
- Safe Option-Critic: Learning Safety in the Option-Critic Architectureβ20Updated 6 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Explorationβ68Updated 3 years ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environmentβ16Updated 7 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"β44Updated 2 years ago
- Implicit Normalizing Flows + Reinforcement Learningβ61Updated 6 years ago
- π΄ OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)β24Updated 4 years ago
- Implementation for paper "A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning".β59Updated 9 months ago
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICLβ¦β55Updated 4 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)β19Updated 2 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)β44Updated 2 years ago
- Implementation of Proximal Policy Optimization in Jax+Flaxβ19Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAXβ24Updated last year