IRMVLab / MADiff
MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos
☆18Updated 6 months ago
Alternatives and similar repositories for MADiff:
Users that are interested in MADiff are comparing it to the libraries listed below
- Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos.☆18Updated 3 weeks ago
- Driving Everywhere with Large Language Model Policy Adaptation☆14Updated 8 months ago
- RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments☆30Updated 2 weeks ago
- [CVPR 2025] CityWalker: Learning Embodied Urban Navigation from Web-Scale Videos☆66Updated last week
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆27Updated 8 months ago
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆43Updated this week
- Official codebase for PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors☆40Updated 6 months ago
- [ECCV 2024]Official PyTorch Implementation of HTCL : Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion☆38Updated 2 months ago
- ☆53Updated 11 months ago
- [ICLR'25] [3D-LLM] City-scale 3D Visual Grounding with Multi-modality LLMs☆36Updated this week
- [IROS 2023] SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion☆32Updated last year
- Project Page for GaussianFormer☆25Updated 10 months ago
- This repository is the latest model version corresponding to the paper 3DSFLabeling: Boosting 3D Scene Flow Estimation by Pseudo Auto Lab…☆32Updated 10 months ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆38Updated 2 months ago
- [ECCV'24] SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving☆88Updated 2 weeks ago
- Official implement for paper "OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries"☆99Updated last year
- GSPR: Multimodal Place Recognition using 3D Gaussian Splatting for Autonomous Driving☆38Updated 6 months ago
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆54Updated 3 months ago
- [CVPR 2025] RelationField: Relate Anything in Radiance Fields☆41Updated last week
- Mask4Former: Mask Transformer for 4D Panoptic Segmentation☆54Updated 10 months ago
- [CVPR 2025] MAC-Ego3D: Multi-Agent Gaussian Consensus for Real-Time Collaborative Ego-Motion and Photorealistic 3D Reconstruction☆33Updated this week
- [ICCV 2023] DELFlow: Dense Efficient Learning of Scene Flow for Large-Scale Point Clouds☆13Updated 11 months ago
- Implementation of IROS21 paper - "Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds"☆29Updated last year
- [ICLR2025] OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework☆32Updated last month
- ☆44Updated 2 months ago
- ☆86Updated 2 months ago
- [NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)☆109Updated 2 months ago
- [CVPR 2025] Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting☆16Updated 3 weeks ago
- [3DV 2024] Repository for "Multi-Body Neural Scene Flow", in International Conference on 3D Vision 2024.☆13Updated last year
- [CVPR2024] Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset☆50Updated 9 months ago