justinjfu / diagnosing_qlearningView external linksLinks
Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.
☆19May 14, 2019Updated 6 years ago
Alternatives and similar repositories for diagnosing_qlearning
Users that are interested in diagnosing_qlearning are comparing it to the libraries listed below
Sorting:
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Jun 13, 2019Updated 6 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- ☆13May 15, 2025Updated 8 months ago
- Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)☆54Jul 7, 2021Updated 4 years ago
- ☆88Jul 30, 2024Updated last year
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning☆34Oct 28, 2020Updated 5 years ago
- P3O paper code☆30Aug 7, 2019Updated 6 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- ☆42May 11, 2022Updated 3 years ago
- Collection of reinforcement learning algorithms☆16Oct 6, 2021Updated 4 years ago
- ☆17Dec 21, 2020Updated 5 years ago
- ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies☆18Jul 16, 2020Updated 5 years ago
- ICRL 2020☆20Feb 18, 2020Updated 5 years ago
- ☆53Feb 16, 2022Updated 3 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- ☆24Feb 16, 2022Updated 3 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- This is the repository for paper "Improving Sepsis Treatment Strategies using Deep Reinforcement Learning and Mixture-of-Experts"☆27Jul 6, 2018Updated 7 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 5 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago
- ☆23Jun 8, 2021Updated 4 years ago
- A library for building reinforcement learning and imitation learning agents in Pytorch☆62Jun 13, 2020Updated 5 years ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆30Oct 26, 2022Updated 3 years ago
- ☆24Nov 10, 2020Updated 5 years ago
- ☆26Mar 16, 2023Updated 2 years ago
- OPE Tools based on Empirical Study of Off Policy Policy Estimation paper.☆62Aug 9, 2022Updated 3 years ago
- Revisiting Rainbow☆75Jun 9, 2021Updated 4 years ago
- ☆202Mar 25, 2023Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 5 years ago
- Fast Flexible Replay Buffer Library (Mirror repository of https://gitlab.com/ymd_h/cpprb)☆74Dec 14, 2024Updated last year
- Implementation of advantage-weighted regression.☆207May 30, 2020Updated 5 years ago
- Unofficial Re-implementation of "Dream to Control: Learning Behaviors by Latent Imagination" (https://arxiv.org/abs/1912.01603 ) with PyT…☆32Aug 22, 2020Updated 5 years ago
- ☆30Sep 3, 2019Updated 6 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆87Jan 24, 2024Updated 2 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆130Mar 21, 2021Updated 4 years ago