4DVLab / FreqpolicyLinks
[NIPS 2025] FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens
☆20Updated 3 months ago
Alternatives and similar repositories for Freqpolicy
Users that are interested in Freqpolicy are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping☆33Updated 2 months ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆28Updated 3 months ago
- The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)☆108Updated 2 months ago
- VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning☆124Updated 4 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆173Updated 7 months ago
- Official repository for "Vid2World: Crafting Video Diffusion Models to Interactive World Models" (ICLR 2026), https://arxiv.org/abs/2505.…☆35Updated last week
- The official implementation of "DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation". (arXiv 2601.22153)☆118Updated last week
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆286Updated last month
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆43Updated last year
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆47Updated 4 months ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆126Updated 3 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆55Updated 10 months ago
- Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"☆121Updated 5 months ago
- Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"☆57Updated 2 weeks ago
- [ICLR 2026] Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"☆72Updated 3 weeks ago
- ☆54Updated last year
- ☆65Updated 2 months ago
- [NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆70Updated 4 months ago
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model☆175Updated last month
- Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models☆146Updated last month
- VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs☆46Updated last month
- Official PyTorch implementation for ICML 2025 paper: UP-VLA.☆55Updated 2 weeks ago