[CVPR 2025] Official codes for the paper 'Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models'
☆35Apr 8, 2025Updated 10 months ago
Alternatives and similar repositories for Mamba4D
Users that are interested in Mamba4D are comparing it to the libraries listed below
Sorting:
- ☆12Aug 5, 2022Updated 3 years ago
- Code for paper: [IEEE T-IV 2024] LXL: LiDAR Excluded Lean 3D Object Detection With 4D Imaging Radar and Camera Fusion☆23Jan 7, 2026Updated last month
- [CVPR'25] SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction☆20Jul 28, 2025Updated 7 months ago
- The code for the paper "LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling" (NeurIPS'24).☆13Dec 25, 2024Updated last year
- [ICCV2025] All in One: Visual-Description-Guided Unified Point Cloud Segmentation☆28Jul 25, 2025Updated 7 months ago
- This is the repository for the 3DinAction paper.☆15Mar 24, 2024Updated last year
- ☆29Oct 4, 2024Updated last year
- Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"☆22Apr 21, 2024Updated last year
- [CVPR 2022] No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-level Space-time Surfaces☆22May 30, 2024Updated last year
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆31Jun 5, 2025Updated 8 months ago
- ☆27Apr 3, 2024Updated last year
- [CVPR 2025] PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding☆96Jun 15, 2025Updated 8 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- Stereo-LiDAR Depth Estimation with Deformable Propagation and Learned Disparity-Depth Conversion (ICRA2024)☆33Jul 10, 2024Updated last year
- MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations☆36Oct 17, 2024Updated last year
- [ICCV2023] Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction☆39Dec 15, 2023Updated 2 years ago
- The codes of D-DPCC: Deep Dynamic Point Cloud Compression via 3D Motion Prediction.☆41Jul 3, 2025Updated 8 months ago
- REPS: Reconstruction-based Point Cloud Sampling☆12Mar 18, 2024Updated last year
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Official repo for PIWM: Enhancing Physical Consistency in Lightweight World Models☆21Nov 26, 2025Updated 3 months ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- ☆43Nov 1, 2024Updated last year
- ☆12Jun 5, 2019Updated 6 years ago
- Instituto de Telecomunicações Deep Learning-based Point Cloud Codec☆11Jun 18, 2024Updated last year
- ☆10Apr 7, 2025Updated 10 months ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- ☆10Oct 5, 2022Updated 3 years ago
- Scene Spatio-Temporal Graph Convolutional Network for Pedestrian Intention Estimation☆12Feb 2, 2022Updated 4 years ago
- Light Field Super-Resolution Network Using Joint Spatio-Angular and Epipolar Information☆10May 31, 2023Updated 2 years ago
- ☆15Nov 4, 2025Updated 3 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- Code for paper "Trajectory-CNN: a new spatio-temporal feature learning network for human motion prediction"☆14Dec 20, 2023Updated 2 years ago
- [ICCV 2025] RadarSplat: Radar Gaussian Splatting for High-Fidelity Data Synthesis and 3D Reconstruction of Autonomous Driving Scenes☆22Feb 10, 2026Updated 3 weeks ago
- Husky-LIO-SAM☆11Feb 23, 2023Updated 3 years ago
- ☆11Jan 18, 2025Updated last year
- Monorepo blueprint for developer platform☆11Dec 22, 2025Updated 2 months ago
- WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model☆40Aug 15, 2025Updated 6 months ago
- CoMA: Compositional Human Motion Generation with Multi-modal Agents☆14Jul 31, 2025Updated 7 months ago
- [IROS 2025] EgoLoc: Zero-Shot Temporal Interaction Localization for Egocentric Videos☆32Jan 13, 2026Updated last month