Official code release for ICLR23 "Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning"
☆16Mar 8, 2023Updated 3 years ago
Alternatives and similar repositories for value_expansion
Users that are interested in value_expansion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of different Relative Entropy Policy Search flavors☆13Nov 15, 2021Updated 4 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- A workflow showing how to combine Inkscape and LaTeX to obtain figures that match your document better☆29May 9, 2021Updated 4 years ago
- Implementation of Sinkhorn Step in JAX, NeurIPS 2023.☆49Jan 29, 2026Updated 2 months ago
- Benchmarking suite for MushroomRL Deep RL algorithms☆16Feb 2, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆14Mar 31, 2026Updated 2 weeks ago
- Content for YouTube videos☆11Mar 16, 2026Updated last month
- Open source Java framework to create, process and manage mixtures of exponential family☆14Aug 4, 2015Updated 10 years ago
- A super-lightweight super-capable agentic tool with improved security versus OpenClaw.☆39Apr 9, 2026Updated last week
- Code for the paper "Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning"☆16Jul 4, 2022Updated 3 years ago
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 3 years ago
- reinforcement learning from randomized simulations☆68Mar 31, 2025Updated last year
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆577Feb 25, 2026Updated last month
- Official code for "Knowledge intensive state design for traffic signal control"☆30Feb 28, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An OpenAI Gym environment for Pokemon battles☆11Sep 3, 2019Updated 6 years ago
- ☆23Jun 8, 2021Updated 4 years ago
- Open source code for paper "Learning World Models with Identifiable Factorization"☆13Mar 4, 2024Updated 2 years ago
- ☆38Jun 25, 2024Updated last year
- A Python package to simulate, solve and visualize the source-tracking POMDP☆20Jan 10, 2025Updated last year
- Program that calculates design parameters for an ornithopter in level flight☆13Aug 11, 2023Updated 2 years ago
- ☆20Sep 16, 2024Updated last year
- ☆22May 12, 2025Updated 11 months ago
- A generic library for linear and non-linear Gaussian smoothing problems. The code leverages JAX and implements several linearization algo…☆13Dec 4, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Lightweight Isaac Gym Environment Builder☆40Nov 30, 2022Updated 3 years ago
- Variational Inference by Policy Search☆13Apr 24, 2019Updated 6 years ago
- A refined approach to add gifs to a dash app☆20Jul 7, 2020Updated 5 years ago
- ☆23Jan 15, 2026Updated 3 months ago
- arXiv? No. ChineseXiv.☆115Mar 24, 2026Updated 3 weeks ago
- ☆13Aug 9, 2022Updated 3 years ago
- ☆25Jul 10, 2023Updated 2 years ago
- implicit behaviour cloning toy 2d example☆14Oct 8, 2021Updated 4 years ago
- A toolbox for inference of switching systems for control☆11Aug 23, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Sep 6, 2019Updated 6 years ago
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆25Oct 27, 2024Updated last year
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆22Nov 29, 2025Updated 4 months ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆21Jul 14, 2024Updated last year
- ☆15Apr 12, 2023Updated 3 years ago
- ☆21Mar 19, 2024Updated 2 years ago
- Open-source codebase for PaMoRL, from "Parallelizing Model-based Reinforcement Learning Over the Sequence Length" at NeurIPS 2024.☆14Dec 17, 2024Updated last year