suyoung-lee / Episodic-Backward-UpdateView external linksLinks
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆16Sep 24, 2019Updated 6 years ago
Alternatives and similar repositories for Episodic-Backward-Update
Users that are interested in Episodic-Backward-Update are comparing it to the libraries listed below
Sorting:
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Mar 24, 2023Updated 2 years ago
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 5 years ago
- ☆35Jul 10, 2021Updated 4 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Feb 1, 2020Updated 6 years ago
- Model-based reinforcement learning (generative simulator models and planning agents)☆16Sep 15, 2021Updated 4 years ago
- ☆14May 31, 2022Updated 3 years ago
- ☆17Sep 28, 2023Updated 2 years ago
- ☆18Jan 3, 2022Updated 4 years ago
- ☆22Oct 4, 2019Updated 6 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- Safe Policy Improvement with Baseline Bootstrapping☆26May 5, 2020Updated 5 years ago
- Model-Based Offline Reinforcement Learning☆52Jan 13, 2021Updated 5 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆28Jan 12, 2023Updated 3 years ago
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Jul 19, 2023Updated 2 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Sep 5, 2018Updated 7 years ago
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆38Aug 27, 2024Updated last year
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆24May 20, 2024Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Sep 30, 2022Updated 3 years ago
- Benchmark data (i.e., DeepMind Control Suite and MuJoCo) for RL.☆33Jan 23, 2021Updated 5 years ago
- Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)☆33Sep 25, 2023Updated 2 years ago
- Personal Repo to keep track of RL papers☆31May 3, 2021Updated 4 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- ☆42May 11, 2022Updated 3 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆156Aug 31, 2021Updated 4 years ago
- A minimal Unreal Engine project for developing and testing UnrealCV☆17Nov 8, 2018Updated 7 years ago
- Implementation of BIMRL: Brain Inspired Meta Reinforcement Learning - Roozbeh Razavi et al. (IROS 2022)☆10Dec 1, 2022Updated 3 years ago
- Material complementar para o curso sobre Node-RED☆10Sep 5, 2019Updated 6 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Jul 17, 2020Updated 5 years ago
- Pytorch Implementation of ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation (https://arxiv.org/abs/1606.02147)☆11Jan 24, 2020Updated 6 years ago
- Explanation of Mathematics used in Machine Learning Algorithms and some Projects☆14Jul 19, 2018Updated 7 years ago
- ☆10Aug 26, 2022Updated 3 years ago
- Google AI Research☆10Mar 11, 2020Updated 5 years ago
- Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"☆23Sep 7, 2025Updated 5 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 7 months ago
- Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery☆16Jul 11, 2023Updated 2 years ago
- ☆14Jul 18, 2025Updated 6 months ago
- ☆10Jun 3, 2019Updated 6 years ago