SanMumumu / FlowRAMLinks
[2025CVPR] FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation
☆51Updated 2 months ago
Alternatives and similar repositories for FlowRAM
Users that are interested in FlowRAM are comparing it to the libraries listed below
Sorting:
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆28Updated 6 months ago
- This is a project about visual spatial reasoning.☆89Updated last month
- [PG 2025] BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion☆56Updated 3 weeks ago
- TorchHook: A PyTorch hooks manager, providing convenient interfaces to capture feature maps and debug models.☆13Updated 4 months ago
- GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models☆486Updated 4 months ago
- EO: Open-source Unified Embodied Foundation Model Series☆290Updated 2 months ago
- (Preprint) ORV: 4D Occupancy-centric Robot Video Generation.☆77Updated 2 months ago
- vue3-elementPlus-admin,vue3-elementPlus-template☆59Updated 2 months ago
- [CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding☆249Updated 7 months ago
- ☆59Updated 7 months ago
- [NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving…☆596Updated 4 months ago
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆230Updated 6 months ago
- [ICRA 2026] A Unified Driving World Model for Future Generation and Perception☆136Updated this week
- A benchmark evaluates LLMs' performance in automating drawing revision tasks.☆57Updated last month
- 🌐 3D and 4D World Modeling: A Survey☆806Updated 3 weeks ago
- Official implementation of [AstraNav-World: World Model for Foresight Control and Consistency]☆52Updated 2 weeks ago
- 用户面试平台☆24Updated 6 months ago
- ☆54Updated 2 months ago
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D☆203Updated last month
- [CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis☆181Updated last year
- Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking (CVPR 2025)☆81Updated 6 months ago
- [ICLR2026] Official implementation for "JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navig…☆447Updated 2 weeks ago
- Official implementation of "Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models".☆74Updated last month
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆258Updated 3 weeks ago
- Official implementation of "ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation"☆88Updated last month
- [ICCV 2025] Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation.☆51Updated 5 months ago
- ☆31Updated 5 months ago
- [ICCV2025] CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation☆20Updated 4 months ago
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆160Updated last month
- [ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆229Updated last month