CMU-AIRe / floqLinks
Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL
☆25Updated 2 months ago
Alternatives and similar repositories for floq
Users that are interested in floq are comparing it to the libraries listed below
Sorting:
- PWM: Policy Learning with Large World Models☆65Updated 5 months ago
- Official implementation of DEMO3☆66Updated 5 months ago
- ☆38Updated 4 months ago
- DrM, a visual RL algorithm, minimizes the dormant ratio to guide exploration-exploitation trade-offs, achieving significant improvements …☆77Updated last year
- The official implementation of Value Flows☆38Updated 2 months ago
- The official implementation of "Horizon Reduction Makes RL Scalable"☆179Updated 5 months ago
- (ICLR 2024) Reverse Forward Curriculum Learning☆51Updated last year
- ☆116Updated 10 months ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆132Updated last year
- ☆75Updated this week
- ☆31Updated last year
- ☆29Updated 2 years ago
- Code for SAPG: Split and Aggregate Policy Gradients (ICML 2024)☆59Updated last year
- JAX implementation of WSRL and RL baselines | ICLR 2025☆125Updated 6 months ago
- ☆35Updated 7 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆92Updated last year
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆26Updated last year
- Skeleton for scalable and flexible Jax RL implementations☆93Updated 2 years ago
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆33Updated 3 months ago
- Jax/Flax Implementation of TD-MPC2☆70Updated this week
- ☆77Updated 7 months ago
- Official release of the DMControl Generalization Benchmark 2 (DMC-GB2)☆22Updated 5 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112Updated last year
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆35Updated last year
- The official implementations of Intention-conditioned Flow Occupancy Models (InFOM)☆29Updated last week
- Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"☆74Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆80Updated 2 years ago
- [ICLR 2025] Bootstrapped Model Predictive Control☆30Updated 5 months ago
- ☆30Updated 11 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆104Updated 3 months ago