Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆16Sep 24, 2019Updated 6 years ago
Alternatives and similar repositories for Episodic-Backward-Update
Users that are interested in Episodic-Backward-Update are comparing it to the libraries listed below
Sorting:
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Mar 24, 2023Updated 2 years ago
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- ☆35Jul 10, 2021Updated 4 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Sep 15, 2021Updated 4 years ago
- ☆14May 31, 2022Updated 3 years ago
- ☆18Apr 11, 2024Updated last year
- ☆17Sep 28, 2023Updated 2 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- ☆22Oct 4, 2019Updated 6 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 5 years ago
- Model-Based Offline Reinforcement Learning☆52Jan 13, 2021Updated 5 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆113May 27, 2024Updated last year
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆28Jan 12, 2023Updated 3 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Jul 19, 2023Updated 2 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Sep 5, 2018Updated 7 years ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆25May 20, 2024Updated last year
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆39Aug 27, 2024Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Sep 30, 2022Updated 3 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆35Mar 6, 2021Updated 5 years ago
- Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.☆33Jan 23, 2021Updated 5 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Jun 24, 2018Updated 7 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- Personal Repo to keep track of RL papers☆31May 3, 2021Updated 4 years ago
- ☆42May 11, 2022Updated 3 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆34Feb 16, 2020Updated 6 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆156Aug 31, 2021Updated 4 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- Material complementar para o curso sobre Node-RED☆10Sep 5, 2019Updated 6 years ago
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 8 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Jul 17, 2020Updated 5 years ago
- ☆92Dec 5, 2023Updated 2 years ago
- all-paths graph kernel for protein-protein interaction extraction☆12Apr 22, 2014Updated 11 years ago
- [AAAI 2023 Oral] Official code for "PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction".☆21Jul 26, 2025Updated 7 months ago
- Pytorch Implementation of ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation (https://arxiv.org/abs/1606.02147)☆11Jan 24, 2020Updated 6 years ago