[ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.
☆25Apr 15, 2023Updated 2 years ago
Alternatives and similar repositories for vagram
Users that are interested in vagram are comparing it to the libraries listed below
Sorting:
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- ☆18Feb 7, 2021Updated 5 years ago
- ☆15Sep 14, 2020Updated 5 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆15Updated this week
- Public Release of Plan2vec Implementation in pyTorch☆57Oct 28, 2022Updated 3 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Mar 9, 2023Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆185Apr 12, 2022Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Nov 14, 2024Updated last year
- Implicit Differentiable Optimal Control (IDOC) with JAX☆12May 11, 2022Updated 3 years ago
- ☆99Mar 24, 2023Updated 2 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆29Oct 6, 2022Updated 3 years ago
- The Official Code for Offline Model-based Adaptable Policy Learning (NeurIPS'21 & TPAMI)☆25Jan 16, 2024Updated 2 years ago
- ☆14Jun 8, 2023Updated 2 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Jun 30, 2020Updated 5 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- ☆13Mar 14, 2024Updated last year
- Library that provides environments for planning problems☆16Feb 21, 2026Updated last week
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Jul 16, 2023Updated 2 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Jan 7, 2026Updated last month
- Graph Learning with JAX☆14Jul 11, 2022Updated 3 years ago
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- Code for "Learning Control-Oriented Dynamical Structure from Data" by Spencer M. Richards, Jean-Jacques Slotine, Navid Azizan, and Marco …☆16Oct 23, 2023Updated 2 years ago
- TD-Regularized Actor-Critic Methods☆36Dec 26, 2019Updated 6 years ago
- Repo for the paper "Landscape Surrogate Learning Decision Losses for Mathematical Optimization Under Partial Information"☆38Jul 20, 2023Updated 2 years ago
- Code release for "Stochastic Optimal Control Matching"☆39Aug 14, 2024Updated last year
- Model-based reinforcement learning using CEM, MPC and PETS☆16Nov 20, 2019Updated 6 years ago
- Windy GridWorlds environments compatible with OpenAI gym.☆15Jul 8, 2022Updated 3 years ago
- My CV☆39Updated this week
- Code the AAAI 2019 paper "Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization"☆35Feb 12, 2021Updated 5 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆531Nov 22, 2022Updated 3 years ago
- Robustness via Retrying: Closed-Loop Robotic Manipulation with Self-Supervised Learning☆16Nov 7, 2018Updated 7 years ago
- The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".☆17Jun 20, 2024Updated last year
- ☆20May 25, 2023Updated 2 years ago
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆35May 24, 2018Updated 7 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆41Aug 27, 2022Updated 3 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Oct 6, 2021Updated 4 years ago