SanMumumu / FlowRAMLinks
[2025CVPR] FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation
☆46Updated 2 months ago
Alternatives and similar repositories for FlowRAM
Users that are interested in FlowRAM are comparing it to the libraries listed below
Sorting:
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆28Updated 5 months ago
- [PG 2025] BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion☆56Updated last week
- [CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding☆246Updated 7 months ago
- This is a project about visual spatial reasoning.☆82Updated last month
- (Preprint) ORV: 4D Occupancy-centric Robot Video Generation.☆76Updated last month
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆210Updated 5 months ago
- GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models☆472Updated 3 months ago
- A Unified Driving World Model for Future Generation and Perception☆132Updated 5 months ago
- TorchHook: A PyTorch hooks manager, providing convenient interfaces to capture feature maps and debug models.☆13Updated 3 months ago
- [ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆227Updated last month
- ☆50Updated last month
- 🌐 Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future☆222Updated this week
- a comprehensive and critical synthesis of the emerging role of GenAI across the full autonomous driving stack☆224Updated 3 months ago
- [AAAI 2026] WorldRFT: Latent World Model Planning with Reinforcement Fine-Tuning for Autonomous Driving☆18Updated 2 weeks ago
- [CVPR 2025] UniScene: Unified Occupancy-centric Driving Scene Generation☆540Updated 2 months ago
- [CoRLW 2025 (Oral), IASEAI 2026] Implementation for "Challenger: Affordable Adversarial Driving Video Generation"☆134Updated last week
- ☆273Updated 2 months ago
- A benchmark evaluates LLMs' performance in automating drawing revision tasks.☆56Updated 2 weeks ago
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D☆197Updated 2 weeks ago
- This is the official implementation of UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving☆190Updated 4 months ago
- EO: Open-source Unified Embodied Foundation Model Series☆280Updated 2 months ago
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆242Updated last month
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆266Updated 2 months ago
- [ICCV 2025] DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation☆179Updated 4 months ago
- ☆58Updated 6 months ago
- [NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving…☆560Updated 3 months ago
- 🌐 3D and 4D World Modeling: A Survey☆755Updated 3 weeks ago
- Official implementation of "ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation"☆79Updated 2 weeks ago
- vue3-elementPlus-admin,vue3-elementPlus-template☆59Updated 2 months ago
- [ICCV 2025] Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation.☆50Updated 4 months ago