[ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.
☆25Apr 15, 2023Updated 3 years ago
Alternatives and similar repositories for vagram
Users that are interested in vagram are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Jun 14, 2021Updated 4 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- ☆15Sep 14, 2020Updated 5 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆188Apr 12, 2022Updated 4 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Mar 9, 2023Updated 3 years ago
- Official Code for "Relative Entropy Pathwise Policy Optimization"☆53May 6, 2026Updated 2 weeks ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆28May 22, 2023Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆182Nov 14, 2024Updated last year
- ☆18May 25, 2023Updated 2 years ago
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Jun 30, 2020Updated 5 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆544Nov 22, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Windy GridWorlds environments compatible with OpenAI gym.☆15Jul 8, 2022Updated 3 years ago
- ☆99Mar 24, 2023Updated 3 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆18Oct 21, 2022Updated 3 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆30Oct 6, 2022Updated 3 years ago
- Library that provides environments for planning problems☆16Apr 24, 2026Updated 3 weeks ago
- ☆17May 14, 2026Updated last week
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Mar 6, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Jul 16, 2023Updated 2 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆94Jul 25, 2024Updated last year
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆35May 24, 2018Updated 7 years ago
- Gantry provides an API that streamlines running experiments in Beaker☆33Apr 8, 2026Updated last month
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- ☆14Jun 8, 2023Updated 2 years ago
- ☆12Mar 14, 2024Updated 2 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆86Jul 27, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆46Sep 20, 2023Updated 2 years ago
- Model-based reinforcement learning in TensorFlow☆56Jul 27, 2021Updated 4 years ago
- Graph Learning with JAX☆14Jul 11, 2022Updated 3 years ago
- Library for Model Based RL☆1,060Jul 12, 2024Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆190May 17, 2022Updated 4 years ago
- Single-Life Reinforcement Learning☆14Dec 17, 2022Updated 3 years ago