tachukao / idocLinks
Implicit Differentiable Optimal Control (IDOC) with JAX
☆12Updated 3 years ago
Alternatives and similar repositories for idoc
Users that are interested in idoc are comparing it to the libraries listed below
Sorting:
- Performant, differentiable reinforcement learning☆24Updated 2 years ago
- a High-Performance Distributed Solver for Large-Scale Markov Decision Processes (MDP) relying on Inexact Policy Iteration; for Python and…☆26Updated 6 months ago
- Online variational GPs☆37Updated 2 years ago
- ☆43Updated 2 years ago
- A PyTorch library for all things nonlinear control and reinforcement learning.☆47Updated 3 years ago
- ☆35Updated 2 years ago
- Reinforcement Learning inside a 3D soccer simulation☆31Updated last year
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆45Updated 2 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆29Updated 3 years ago
- Model-based reinforcement learning in TensorFlow☆56Updated 4 years ago
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 2 years ago
- A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.☆17Updated 2 years ago
- A toolbox for inference of mixture models☆16Updated 2 years ago
- a little library to help me with things involving Koopman operators☆12Updated 3 years ago
- ☆43Updated 4 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Updated 2 years ago
- ☆27Updated 4 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated 2 years ago
- This repository contains the source code to perform Geometry-aware Bayesian Optimization (GaBO) on Riemannian manifolds.☆53Updated 3 years ago
- Repository for "Quality-Diversity Actor-Critic: Learning High-Performing and Diverse Behaviors via Value and Successor Features Critics" …☆17Updated last year
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Updated 5 years ago
- Cross-Domain Imitation Learning via Optimal Transport☆25Updated 3 years ago
- Contact-Aware Symplectic Integrator Network☆15Updated 2 years ago
- improved Cross Entropy Method for trajectory optimization☆80Updated 4 years ago
- ☆14Updated 2 years ago
- ☆35Updated 3 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated 2 years ago
- By introducing a differentiable contact model, DiffCoSim extends the applicability of Lagrangian/Hamiltonian-inspired neural networks to …☆36Updated 2 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago