Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
Alternatives and similar repositories for D3G
Users that are interested in D3G are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- I2Q: A Fully Decentralized Q-Learning Algorithm☆19Nov 10, 2022Updated 3 years ago
- ☆15Jan 18, 2026Updated 2 months ago
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Mar 24, 2023Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- code for the paper Offline Prioritized Experience Replay☆12Jun 13, 2023Updated 2 years ago
- ☆11Oct 3, 2022Updated 3 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- RoboVat: A unified toolkit for simulated and real-world robotic task environments.☆67Nov 22, 2022Updated 3 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆48Sep 20, 2023Updated 2 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- ☆32Jun 21, 2024Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆61Apr 29, 2024Updated last year
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆537Nov 22, 2022Updated 3 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- DrQ: Data regularized Q☆419Jan 13, 2023Updated 3 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112May 27, 2024Updated last year
- ☆10Aug 17, 2022Updated 3 years ago
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆36Dec 30, 2024Updated last year
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆97Jun 19, 2025Updated 9 months ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- ICRL 2020☆20Feb 18, 2020Updated 6 years ago
- A simple, continuous-control environment for OpenAI Gym☆23Jan 1, 2023Updated 3 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Oct 22, 2019Updated 6 years ago
- ☆17Sep 28, 2023Updated 2 years ago
- ☆29Oct 3, 2023Updated 2 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆19Mar 10, 2021Updated 5 years ago
- ☆14Jun 26, 2019Updated 6 years ago