uber-research/D3G

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/uber-research/D3G)

uber-research / D3G

Estimating Q(s,s') with Deep Deterministic Dynamics Gradients

☆32

Alternatives and similar repositories for D3G

Users that are interested in D3G are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jiechuanjiang / I2Q
View on GitHub
I2Q: A Fully Decentralized Q-Learning Algorithm
☆19Nov 10, 2022Updated 3 years ago
apexrl / bmpo
View on GitHub
Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>
☆23Mar 24, 2023Updated 3 years ago
pcchenxi / LAPO-offlienRL
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
bonniesjli / DQN_SR
View on GitHub
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Jun 19, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
facebookresearch / slbo
View on GitHub
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆94Sep 13, 2019Updated 6 years ago
suyoung-lee / Episodic-Backward-Update
View on GitHub
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆16Sep 24, 2019Updated 6 years ago
geyang / plan2vec
View on GitHub
Public Release of Plan2vec Implementation in pyTorch
☆57Oct 28, 2022Updated 3 years ago
sail-sg / OPER
View on GitHub
code for the paper Offline Prioritized Experience Replay
☆12Jun 13, 2023Updated 3 years ago
tung-nd / cwbc
View on GitHub
☆11Oct 3, 2022Updated 3 years ago
StanfordVL / robovat
View on GitHub
RoboVat: A unified toolkit for simulated and real-world robotic task environments.
☆67Nov 22, 2022Updated 3 years ago
dmksjfl / SEABO
View on GitHub
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
jannerm / gamma-models
View on GitHub
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
☆48Sep 20, 2023Updated 2 years ago
Stilwell-Git / Randomized-Return-Decomposition
View on GitHub
TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"
☆19Mar 17, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
astier / model-free-episodic-control
View on GitHub
Model-Free-Episodic-Control implementation.
☆17Jun 3, 2019Updated 7 years ago
orybkin / video-gcp
View on GitHub
Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"
☆46Nov 22, 2022Updated 3 years ago
google-deepmind / dmc_vision_benchmark
View on GitHub
☆34Jun 21, 2024Updated 2 years ago
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
RockySJ / ampo
View on GitHub
☆15Oct 20, 2020Updated 5 years ago
McGill-NLP / feedbackqa
View on GitHub
FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback
☆12Jul 13, 2022Updated 4 years ago
dmksjfl / MCQ
View on GitHub
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆64Apr 29, 2024Updated 2 years ago
jannerm / mbpo
View on GitHub
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆558Nov 22, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
RuohanW / RED
View on GitHub
Implementation of Random Expert Distillation
☆29May 11, 2019Updated 7 years ago
denisyarats / drq
View on GitHub
DrQ: Data regularized Q
☆422Jan 13, 2023Updated 3 years ago
google-deepmind / affordances_option_models
View on GitHub
☆22Nov 8, 2021Updated 4 years ago
LAMDA-RL / ImagineBench
View on GitHub
A benchmark for evaluating reinforcement learning algorithms that train the policies using imaginary rollouts from LLMs.
☆15Nov 4, 2025Updated 8 months ago
conglu1997 / v-d4rl
View on GitHub
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆115Apr 16, 2026Updated 3 months ago
seungyulhan / disc
View on GitHub
☆10Aug 17, 2022Updated 3 years ago
qlan3 / Explorer
View on GitHub
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆98Updated this week
AutumnWu / Streamlined-Off-Policy-Learning
View on GitHub
ICRL 2020
☆20Feb 18, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
supratikp / HOOF
View on GitHub
Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583
☆19Oct 22, 2019Updated 6 years ago
0xangelo / gym-cartpole-swingup
View on GitHub
A simple, continuous-control environment for OpenAI Gym
☆23Jan 1, 2023Updated 3 years ago
charleshsc / QT
View on GitHub
ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning
☆38Dec 30, 2024Updated last year
d5rlbenchmark / d5rl
View on GitHub
☆31Oct 3, 2023Updated 2 years ago
YyzHarry / SV-RL
View on GitHub
[ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning
☆34Feb 1, 2020Updated 6 years ago
uber-research / Evolvability-ES
View on GitHub
☆14Jun 26, 2019Updated 7 years ago
MishaLaskin / curl
View on GitHub
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
☆605Oct 28, 2020Updated 5 years ago