SanMumumu / FlowRAMLinks
[2025CVPR] FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation
☆38Updated 2 weeks ago
Alternatives and similar repositories for FlowRAM
Users that are interested in FlowRAM are comparing it to the libraries listed below
Sorting:
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆28Updated 4 months ago
- TorchHook: A PyTorch hooks manager, providing convenient interfaces to capture feature maps and debug models.☆13Updated last month
- This is a project about visual spatial reasoning.☆79Updated last month
- ☆54Updated 4 months ago
- vue3-elementPlus-admin,vue3-elementPlus-template☆54Updated 2 weeks ago
- EO: Open-source Unified Embodied Foundation Model Series☆272Updated 2 weeks ago
- 用户面试平台☆22Updated 3 months ago
- (Preprint) ORV: 4D Occupancy-centric Robot Video Generation.☆70Updated this week
- A benchmark evaluates LLMs' performance in automating drawing revision tasks.☆56Updated 3 months ago
- 这个算法用于无人机群避障一个加入机群的无人机,算法分为两种思路:(1)加入者的路径规划主动机动规避编队机群、(2)编队微调避让加入者。目前只做了第一种思路。唯一已知信息是原机群的运动轨迹F(x,y,z,t)|each plane,对于第一种思路:对于补位飞机唯一的输入参数是…☆27Updated 3 months ago
- [PG 2025] BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion☆51Updated last month
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆86Updated this week
- [CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding☆236Updated 5 months ago
- GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models☆450Updated 2 months ago
- ☆31Updated 3 months ago
- [NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving…☆462Updated 2 months ago
- A Unified Driving World Model for Future Generation and Perception☆126Updated 4 months ago
- 🚀 基于 Vue3、TypeScript、Vite 的企业级中后台快速开发框架,采用模块化设计,内置丰富的业务组件。☆23Updated last month
- Quantify and analyze distribution shifts in learning from samples.☆35Updated last month
- [CVPR 2024] RankMatch: Exploring the Better Consistency Regularization for Semi-supervised Semantic Segmentation☆31Updated 9 months ago
- Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking (CVPR 2025)☆76Updated 4 months ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AI☆103Updated this week
- [ICCV 2025] Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation.☆46Updated 3 months ago
- [CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation☆71Updated 2 months ago
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆192Updated 4 months ago
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model☆1,723Updated last week
- [ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆217Updated last month
- Pwn exploitation toolkit with a CLI for exp templates, and provide Python APIs for Linux binex, scripts, etc.☆35Updated last month
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D☆164Updated 3 weeks ago
- [ACMMM 2025] Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompti…☆210Updated 6 months ago