PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020
☆45Oct 25, 2020Updated 5 years ago
Alternatives and similar repositories for mbrl-smdp-ode
Users that are interested in mbrl-smdp-ode are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Neural Ordinary Differential Equations for Reinforcement Learning☆25Jul 6, 2023Updated 2 years ago
- Official repository for the paper "Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules" (…☆24Jun 11, 2025Updated last year
- DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control☆23Sep 14, 2020Updated 5 years ago
- ☆15Apr 5, 2023Updated 3 years ago
- [NeurIPS'23] ODE-based Recurrent Model-free Reinforcement Learning for POMDPs☆18May 3, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Aug 9, 2020Updated 5 years ago
- ☆12May 5, 2023Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆15Aug 4, 2020Updated 5 years ago
- Implementation of the rough volatility model and its calibration☆10Jul 11, 2020Updated 5 years ago
- Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".☆14May 23, 2021Updated 5 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- Neural Laplace Control for Continuous-time Delayed Systems - an offline RL method combining Neural Laplace dynamics model and MPC planner…☆17Apr 26, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆10Sep 9, 2022Updated 3 years ago
- ☆14May 31, 2022Updated 4 years ago
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- Generating global explanations from local ones☆11Nov 11, 2022Updated 3 years ago
- Bayer, Friz, Gassiat, Martin, Stemper (2017). A regularity structure for finance.☆12Sep 29, 2017Updated 8 years ago
- Scaling Population-Based Reinforcement Learning with GPU Accelerated Simulation☆13Nov 5, 2025Updated 7 months ago
- Distributed & asynchronous DQN implementation using gRPC and PyTorch.☆10Feb 15, 2021Updated 5 years ago
- ☆72Jun 20, 2022Updated 4 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆11Jul 4, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Model-Based Visual Planning with Self-Supervised Functional Distances (ICLR 2021)☆20Jul 31, 2021Updated 4 years ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆27Feb 11, 2025Updated last year
- Analytical solution and calibration☆14Aug 1, 2011Updated 14 years ago
- Standalone library of frequently-used wrappers for dm_env environments.☆19Jul 9, 2024Updated last year
- ☆14Jun 7, 2024Updated 2 years ago
- Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/ab…☆12Aug 21, 2022Updated 3 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆71Jul 17, 2021Updated 4 years ago
- ☆13Jul 3, 2023Updated 3 years ago
- Official codebase for Exact Energy-Guided Diffusion Sampling via Contrastive Energy Prediction☆35Nov 3, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Get the url for a youtube video's subtitles.☆10Mar 23, 2018Updated 8 years ago
- Build custom model types for estimation.☆12Updated this week
- ☆14Apr 8, 2021Updated 5 years ago
- Implementation for Phenotype prediction from single-cell RNA-seq data using attention-based neural networks (Bioinformatics 2024).☆13Jul 15, 2024Updated last year
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆116Dec 5, 2023Updated 2 years ago
- Official code for "Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Trans…☆10Sep 11, 2024Updated last year
- Translated notes from Matlab to Python for Dave Backus's Macrofoundations class.☆15Oct 11, 2017Updated 8 years ago