SanMumumu / FlowRAMLinks
[2025CVPR] FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation
☆51Updated 2 months ago
Alternatives and similar repositories for FlowRAM
Users that are interested in FlowRAM are comparing it to the libraries listed below
Sorting:
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆28Updated 6 months ago
- This is a project about visual spatial reasoning.☆89Updated last month
- [CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding☆249Updated 7 months ago
- [PG 2025] BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion☆56Updated 3 weeks ago
- TorchHook: A PyTorch hooks manager, providing convenient interfaces to capture feature maps and debug models.☆13Updated 4 months ago
- (Preprint) ORV: 4D Occupancy-centric Robot Video Generation.☆77Updated 2 months ago
- vue3-elementPlus-admin,vue3-elementPlus-template☆59Updated 2 months ago
- GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models☆486Updated 4 months ago
- EO: Open-source Unified Embodied Foundation Model Series☆290Updated 2 months ago
- [NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving…☆596Updated 4 months ago
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆230Updated 6 months ago
- ☆59Updated 7 months ago
- [ICRA 2026] A Unified Driving World Model for Future Generation and Perception☆136Updated this week
- Official implementation of [AstraNav-World: World Model for Foresight Control and Consistency]☆52Updated 2 weeks ago
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D☆203Updated last month
- [ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆229Updated last month
- 🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future☆286Updated 3 weeks ago
- 🌐 3D and 4D World Modeling: A Survey☆793Updated 3 weeks ago
- [CVPR 2025] UniScene: Unified Occupancy-centric Driving Scene Generation☆550Updated 3 weeks ago
- 用户面试平台☆24Updated 6 months ago
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆272Updated 3 months ago
- [CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis☆181Updated last year
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆258Updated 3 weeks ago
- [ACMMM 2025] Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompti…☆217Updated 9 months ago
- ☆54Updated 2 months ago
- [AAAI 2026 Oral] LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences☆184Updated last month
- [CoRLW 2025 (Oral), IASEAI 2026] Implementation for "Challenger: Affordable Adversarial Driving Video Generation"☆139Updated last month
- Official implementation of "ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation"☆88Updated last month
- a comprehensive and critical synthesis of the emerging role of GenAI across the full autonomous driving stack☆229Updated 4 months ago
- DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving (ICLR 2026)☆297Updated 2 weeks ago