nnaisense/MAGE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nnaisense/MAGE)

nnaisense / MAGE

Learning Action-Value Gradients in Model-based Policy Optimization

☆32

Alternatives and similar repositories for MAGE

Users that are interested in MAGE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / svg
View on GitHub
On the model-based stochastic value gradient for continuous reinforcement learning
☆58Mar 6, 2026Updated 4 months ago
evgenii-nikishin / omd
View on GitHub
JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"
☆43Jun 14, 2021Updated 5 years ago
roosephu / boots
View on GitHub
☆11Oct 14, 2019Updated 6 years ago
WilsonWangTHU / POPLIN
View on GitHub
☆99Mar 24, 2023Updated 3 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
willwhitney / dynamics-aware-embeddings
View on GitHub
Official implementation of DynE, Dynamics-aware Embeddings for RL
☆45Apr 28, 2021Updated 5 years ago
jannerm / mbpo
View on GitHub
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆558Nov 22, 2022Updated 3 years ago
philipjball / ReadyPolicyOne
View on GitHub
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Jul 6, 2023Updated 3 years ago
nnaisense / MAX
View on GitHub
Code for reproducing experiments in Model-Based Active Exploration, ICML 2019
☆81Jul 23, 2019Updated 7 years ago
thiagopbueno / tf-mdp
View on GitHub
Probabilistic planning in continuous state-action MDPs in TensorFlow.
☆13Jun 21, 2022Updated 4 years ago
pairlab / vagram
View on GitHub
[ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.
☆25Apr 15, 2023Updated 3 years ago
AutumnWu / Streamlined-Off-Policy-Learning
View on GitHub
ICRL 2020
☆20Feb 18, 2020Updated 6 years ago
jxu43 / replication-mbpo
View on GitHub
NeurIPS Reproducibility Challenge 2019
☆21Feb 25, 2020Updated 6 years ago
WilsonWangTHU / mbbl
View on GitHub
☆399Jul 18, 2019Updated 7 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
jannerm / gamma-models
View on GitHub
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
☆48Sep 20, 2023Updated 2 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
rems75 / SPIBB-DQN
View on GitHub
Code for SPIBB-DQN and Soft-SPIBB-DQN
☆11May 5, 2020Updated 6 years ago
dbcbtc / RL-Papers
View on GitHub
papers about reinforcement learning
☆13Jan 4, 2021Updated 5 years ago
dmksjfl / SEABO
View on GitHub
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
xuanlinli17 / iclr2021_rlreg
View on GitHub
Regularization Matters in Policy Optimization
☆21Nov 1, 2021Updated 4 years ago
Xingyu-Lin / mbpo_pytorch
View on GitHub
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
☆189Apr 12, 2022Updated 4 years ago
jiangsy / mbpo_pytorch
View on GitHub
☆30Mar 1, 2022Updated 4 years ago
Santara / stochastic_value_gradient
View on GitHub
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆25Jan 15, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mklissa / PPOC
View on GitHub
Proximal Policy Option-Critic
☆26Jan 4, 2019Updated 7 years ago
jparkerholder / DvD_ES
View on GitHub
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆46Oct 29, 2020Updated 5 years ago
jonasrothfuss / model_ensemble_meta_learning
View on GitHub
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Nov 15, 2018Updated 7 years ago
thiagopbueno / rddlgym
View on GitHub
A toolkit for working with RDDL domains in Python3.
☆18Nov 7, 2020Updated 5 years ago
dmksjfl / PAR
View on GitHub
Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)
☆15Aug 15, 2025Updated 11 months ago
ruizhaogit / mep
View on GitHub
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)
☆24May 30, 2019Updated 7 years ago
ming93 / Safe_reinforcement_learning
View on GitHub
Convergent Policy Optimization for Safe Reinforcement Learning
☆11Oct 26, 2019Updated 6 years ago
johanobandoc / revisiting_rainbow
View on GitHub
Revisiting Rainbow
☆76Jun 9, 2021Updated 5 years ago
david-abel / rl_info_theory
View on GitHub
A collection of code investigating the use of information theory for abstractions in RL
☆16Nov 14, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
philipjball / OffCon3
View on GitHub
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆25Jun 20, 2021Updated 5 years ago
tmoer / multimodal_varinf
View on GitHub
Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".
☆35May 24, 2018Updated 8 years ago
ravi-lanka-4 / CoPiEr
View on GitHub
Co-training for Policy Learning
☆13Aug 8, 2019Updated 6 years ago
Bellman-devs / bellman
View on GitHub
Model-based reinforcement learning in TensorFlow
☆57Jul 27, 2021Updated 4 years ago
hari-sikchi / LOOP
View on GitHub
Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]
☆42Aug 27, 2022Updated 3 years ago
acguez / bamcp
View on GitHub
Bayes-Adaptive Monte-Carlo Planning algorithm
☆19Mar 5, 2013Updated 13 years ago
robintyh1 / icml2021-pengqlambda
View on GitHub
Revisiting Peng's Q(lambda) for Modern Reinforcement Learning
☆15Jul 23, 2021Updated 5 years ago