tachukao / idoc
Implicit Differentiable Optimal Control (IDOC) with JAX
☆12Updated 2 years ago
Alternatives and similar repositories for idoc:
Users that are interested in idoc are comparing it to the libraries listed below
- Performant, differentiable reinforcement learning☆25Updated last year
- A toolbox for inference of mixture models☆16Updated last year
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Updated 4 years ago
- A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.☆15Updated 2 years ago
- a High-Performance Distributed Solver for Large-Scale Markov Decision Processes (MDP) relying on Inexact Policy Iteration; for Python and…☆25Updated 2 weeks ago
- Generalised UDRL☆37Updated 2 years ago
- Companion code in JAX for the paper Parallel Iterated Extended and Sigma-Point Kalman Smoothers.☆27Updated 8 months ago
- Google AI Princeton control framework☆38Updated 4 years ago
- ☆27Updated 4 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated last year
- Online variational GPs☆32Updated last year
- Variational Reinforcement Learning☆16Updated 9 months ago
- Tree-structured recurrent switching linear dynamical systems☆36Updated 4 years ago
- JAX workshop for beginners☆8Updated 3 months ago
- Flexible Inference for Predictive Coding Networks in JAX.☆32Updated 2 weeks ago
- Cross-Domain Imitation Learning via Optimal Transport☆24Updated 2 years ago
- ☆36Updated 2 years ago
- Contact-Aware Symplectic Integrator Network☆13Updated 2 years ago
- My PhD thesis. I defended on the 30th of October, 2020! See https://github.com/eleurent/phd-defense/☆14Updated 3 years ago
- Python bindings to some optimization benchmarks (robotics problems), in order to constrained optimization solvers. Includes also an inter…☆21Updated 2 years ago
- Model-based reinforcement learning in TensorFlow☆55Updated 3 years ago
- This is a collection of code samples aimed at illustrating temporal parallelization methods for sequential data.☆31Updated last year
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated 2 years ago
- Neural Fixed-Point Acceleration for Convex Optimization☆29Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆37Updated 2 years ago
- ☆16Updated 4 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆33Updated 2 years ago