Flow RL is a high-performance RL library with flow and diffusion models.
☆28Mar 17, 2026Updated this week
Alternatives and similar repositories for flow-rl
Users that are interested in flow-rl are comparing it to the libraries listed below
Sorting:
- Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference☆31Nov 24, 2025Updated 3 months ago
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆21Mar 24, 2025Updated 11 months ago
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- Model-Based Uncertainty in Value Functions (AISTATS2023)☆16Feb 28, 2023Updated 3 years ago
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 4 months ago
- Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.☆35Oct 7, 2024Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆27Jul 19, 2023Updated 2 years ago
- A minimal implementation of Drifting Models for 2D toy data. Unlike diffusion/flow models that iterate at inference, drifting models evo…☆70Feb 13, 2026Updated last month
- ☆12Jan 25, 2026Updated last month
- [TMLR 2025 & ICLR 2025 DeLTa] Official Implementation of Design Editing for Offline Model-based Optimization 🧬 🤖☆10Apr 17, 2025Updated 11 months ago
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- ☆14Sep 29, 2025Updated 5 months ago
- Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"☆18Oct 5, 2024Updated last year
- The interface between probabilistic model checking and data-driven policy learning.☆16Mar 11, 2026Updated last week
- Official implementation of NeurIPS'24 Spotlight paper "Monte Carlo Tree Search based Space Transfer for Black-box Optimization".☆13Nov 28, 2024Updated last year
- An introduction to AI methods for flying agents (birds, UAVs, etc.)☆17Jul 28, 2023Updated 2 years ago
- ☆30Dec 23, 2025Updated 2 months ago
- ☆18Jul 8, 2025Updated 8 months ago
- ☆14Nov 2, 2022Updated 3 years ago
- A benchmark for offline goal-conditioned RL and offline RL☆346Jan 14, 2026Updated 2 months ago
- Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"☆32Apr 15, 2025Updated 11 months ago
- Baselines for Model-Based Optimization installation fixes and compatible with newer AMPERE+ GPUs (e.g. 3090)☆11Apr 30, 2023Updated 2 years ago
- P1: Mastering Physics Olympiads with Reinforcement Learning☆79Dec 29, 2025Updated 2 months ago
- Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning☆27Jul 4, 2025Updated 8 months ago
- A python module designed for agile RL algorithm developing.☆26Jul 11, 2024Updated last year
- Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…☆18Nov 8, 2024Updated last year
- ☆21Mar 19, 2024Updated 2 years ago
- Official implementation for ICLR 2025 paper "Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning"☆20Mar 5, 2025Updated last year
- Code for the paper "Bounce: Reliable High-Dimensional Bayesian Optimization for Combinatorial and Mixed Spaces"☆15Apr 30, 2024Updated last year
- Learning Safety Constraints for Large Language Models (ICML2025)☆32Aug 4, 2025Updated 7 months ago
- Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL☆37Feb 7, 2026Updated last month
- ☆15May 17, 2024Updated last year
- Standalone library of frequently-used wrappers for dm_env environments.☆19Jul 9, 2024Updated last year
- ☆10May 28, 2023Updated 2 years ago
- Non-official implementation of paper "In-context Reinforcement Learning with Algorithm Distillation"☆12Aug 15, 2024Updated last year
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆75Mar 22, 2024Updated last year
- ☆18Jul 17, 2019Updated 6 years ago
- Code for "A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences"☆16Feb 24, 2025Updated last year