[ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding
☆26Oct 16, 2023Updated 2 years ago
Alternatives and similar repositories for MGMAE
Users that are interested in MGMAE are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆20Jul 29, 2024Updated last year
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆23Jul 29, 2024Updated last year
- [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling☆21Jul 7, 2025Updated 8 months ago
- [ICCV 2023] Deep Equilibrium Object Detection☆27Jun 18, 2025Updated 8 months ago
- ☆11Sep 4, 2024Updated last year
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 7 months ago
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Jul 15, 2022Updated 3 years ago
- Unofficial implementation of PointNet and PointNet++☆10Oct 26, 2023Updated 2 years ago
- Code for "Unsupervised Space-Time Network for Temporally-Consistent Segmentation of Multiple Motions." (CVPR 2023)☆11Jun 15, 2023Updated 2 years ago
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆34Dec 23, 2024Updated last year
- (ICLR 2024, CVPR 2024) SparseFormer☆75Nov 10, 2024Updated last year
- Implementation for paper : EM-driven unsupervised learning for efficient motion segmentation☆17Feb 16, 2023Updated 3 years ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆38Aug 29, 2023Updated 2 years ago
- [WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation☆26Mar 28, 2025Updated 11 months ago
- Track healthy organs in medical scans to improve cancer treatment☆13Jun 23, 2022Updated 3 years ago
- [ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking☆216Oct 15, 2025Updated 4 months ago
- [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models☆18Jan 11, 2026Updated last month
- spatio-temporal tasks☆16Jul 15, 2024Updated last year
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆63May 18, 2023Updated 2 years ago
- [TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding☆29Sep 11, 2024Updated last year
- ☆35Dec 3, 2020Updated 5 years ago
- A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video☆34Apr 12, 2022Updated 3 years ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆341Apr 2, 2024Updated last year
- ☆53Mar 17, 2025Updated 11 months ago
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆47Apr 27, 2025Updated 10 months ago
- ☆11Sep 2, 2024Updated last year
- [ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric☆401Aug 15, 2024Updated last year
- [ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection☆99Sep 12, 2023Updated 2 years ago
- This is anonymous repository for submitting our work to a conference☆14Dec 17, 2024Updated last year
- Vehicle to Vehicle Communication in Self-Driving Car☆12May 14, 2018Updated 7 years ago
- Simple Tensorflow implementation of "MirrorGAN: Learning Text-to-image Generation by Redescription" (CVPR 2019)☆15Mar 23, 2020Updated 5 years ago
- [CVPR 2024 Highlight] Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering☆10Jul 29, 2024Updated last year
- A synthetic training data generator for a text recognition CNN☆10Jul 8, 2019Updated 6 years ago
- ☆17Feb 8, 2026Updated last month
- Official repository of "TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection"☆11May 25, 2025Updated 9 months ago
- Official PyTorch implementation of: "Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in V…☆14Aug 29, 2022Updated 3 years ago
- ☆10Feb 23, 2021Updated 5 years ago
- Qwen-SAM is a reasoning-based segmentation model that integrates Qwen 2.5 VL 7B with the Segment Anything Model (SAM), enabling fine-grai…☆25Jun 4, 2025Updated 9 months ago
- A simple Computer Vision Framework, mainly based on PyTorch. Including distributed training, logging and so on.☆12Dec 2, 2023Updated 2 years ago