Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
Alternatives and similar repositories for D3G
Users that are interested in D3G are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- I2Q: A Fully Decentralized Q-Learning Algorithm☆19Nov 10, 2022Updated 3 years ago
- ☆16Apr 14, 2026Updated 2 weeks ago
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Mar 24, 2023Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- code for the paper Offline Prioritized Experience Replay☆12Jun 13, 2023Updated 2 years ago
- ☆11Oct 3, 2022Updated 3 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- RoboVat: A unified toolkit for simulated and real-world robotic task environments.☆67Nov 22, 2022Updated 3 years ago
- Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning☆12Jan 19, 2024Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆47Sep 20, 2023Updated 2 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting☆16Feb 14, 2024Updated 2 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- ☆32Jun 21, 2024Updated last year
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆62Apr 29, 2024Updated 2 years ago
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆543Nov 22, 2022Updated 3 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- DrQ: Data regularized Q☆420Jan 13, 2023Updated 3 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆114Apr 16, 2026Updated 2 weeks ago
- ☆10Aug 17, 2022Updated 3 years ago
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆38Dec 30, 2024Updated last year
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆98Jun 19, 2025Updated 10 months ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ICRL 2020☆20Feb 18, 2020Updated 6 years ago
- A simple, continuous-control environment for OpenAI Gym☆23Jan 1, 2023Updated 3 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Oct 22, 2019Updated 6 years ago
- ☆17Sep 28, 2023Updated 2 years ago
- ☆30Oct 3, 2023Updated 2 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆20Mar 10, 2021Updated 5 years ago