Official code release for ICLR23 "Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning"
☆16Mar 8, 2023Updated 3 years ago
Alternatives and similar repositories for value_expansion
Users that are interested in value_expansion are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Model Tensor Planning in JAX, TMLR 2025 & ICLR 2026.☆29Jun 5, 2025Updated 11 months ago
- Implementation of different Relative Entropy Policy Search flavors☆13Nov 15, 2021Updated 4 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- ☆10Sep 19, 2023Updated 2 years ago
- A workflow showing how to combine Inkscape and LaTeX to obtain figures that match your document better☆29May 9, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of Sinkhorn Step in JAX, NeurIPS 2023.☆49Jan 29, 2026Updated 3 months ago
- Benchmarking suite for MushroomRL Deep RL algorithms☆16Feb 2, 2024Updated 2 years ago
- Flatland Multi Agent Reinforcement Learning☆16Aug 1, 2020Updated 5 years ago
- A library containing a collection of distance and similarity measures for data analysis☆16Mar 25, 2026Updated last month
- Simple change of a3c to a2c☆15Jun 18, 2017Updated 8 years ago
- ☆14Updated this week
- Implementation of Receding Horizon Curiosity Algrithm☆13Mar 24, 2023Updated 3 years ago
- Open source Java framework to create, process and manage mixtures of exponential family☆14Aug 4, 2015Updated 10 years ago
- A super-lightweight super-capable agentic tool with improved security versus OpenClaw.☆46Apr 28, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the paper "Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning"☆16Jul 4, 2022Updated 3 years ago
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 3 years ago
- reinforcement learning from randomized simulations☆68Mar 31, 2025Updated last year
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆583Feb 25, 2026Updated 2 months ago
- ☆23Jun 8, 2021Updated 4 years ago
- PyTorch implementation of DreamerV3 from "Mastering Diverse Domains with World Models"☆16Aug 8, 2025Updated 8 months ago
- Open source code for paper "Learning World Models with Identifiable Factorization"☆13Mar 4, 2024Updated 2 years ago
- ☆37Jun 25, 2024Updated last year
- A Python package to simulate, solve and visualize the source-tracking POMDP☆20Jan 10, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Program that calculates design parameters for an ornithopter in level flight☆13Aug 11, 2023Updated 2 years ago
- ☆24May 12, 2025Updated 11 months ago
- A generic library for linear and non-linear Gaussian smoothing problems. The code leverages JAX and implements several linearization algo…☆13Apr 20, 2026Updated 2 weeks ago
- Lightweight Isaac Gym Environment Builder☆40Nov 30, 2022Updated 3 years ago
- Variational Inference by Policy Search☆13Apr 24, 2019Updated 7 years ago
- A refined approach to add gifs to a dash app☆20Jul 7, 2020Updated 5 years ago
- ☆23Jan 15, 2026Updated 3 months ago
- ☆13Aug 9, 2022Updated 3 years ago
- ☆25Jul 10, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- implicit behaviour cloning toy 2d example☆14Oct 8, 2021Updated 4 years ago
- A toolbox for inference of switching systems for control☆11Aug 23, 2021Updated 4 years ago
- ☆10Sep 6, 2019Updated 6 years ago
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆26Oct 27, 2024Updated last year
- This repository contains the source code of the paper "Learning Accurate and Interpretable Decision Rule Sets from Neural Networks".☆16Jan 10, 2022Updated 4 years ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆22Jul 14, 2024Updated last year
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆23Nov 29, 2025Updated 5 months ago