[ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding
☆26Oct 16, 2023Updated 2 years ago
Alternatives and similar repositories for MGMAE
Users that are interested in MGMAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆23Jul 29, 2024Updated last year
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆38Aug 20, 2024Updated last year
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 7 months ago
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆20Jul 29, 2024Updated last year
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Jul 15, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [CVPR 2024] Sparse Global Matching for Video Frame Interpolation with Large Motion☆78Jul 4, 2024Updated last year
- [ICCV 2023] Deep Equilibrium Object Detection☆27Jun 18, 2025Updated 9 months ago
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆34Dec 23, 2024Updated last year
- ☆11Sep 4, 2024Updated last year
- [WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation☆27Mar 28, 2025Updated last year
- Code for "Unsupervised Space-Time Network for Temporally-Consistent Segmentation of Multiple Motions." (CVPR 2023)☆11Jun 15, 2023Updated 2 years ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆39Aug 29, 2023Updated 2 years ago
- Unofficial implementation of PointNet and PointNet++☆10Oct 26, 2023Updated 2 years ago
- (ICLR 2024, CVPR 2024) SparseFormer☆76Nov 10, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code and model for "Multi-dataset Training of Transformers for Robust Action Recognition", NeurIPS 2022 Spotlight☆20Aug 1, 2023Updated 2 years ago
- [TIP] APP-Net: Auxiliary-point-based Push and Pull Operations for Efficient Point Cloud Recognition☆13May 15, 2023Updated 2 years ago
- [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models☆18Jan 11, 2026Updated 2 months ago
- [ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking☆220Oct 15, 2025Updated 5 months ago
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆33Aug 30, 2023Updated 2 years ago
- [CVPR 2024] SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos☆17May 21, 2024Updated last year
- Track healthy organs in medical scans to improve cancer treatment☆13Jun 23, 2022Updated 3 years ago
- Implementation for paper : EM-driven unsupervised learning for efficient motion segmentation☆17Feb 16, 2023Updated 3 years ago
- [CVPR 2021] CGA-Net: Category Guided Aggregation for Point Cloud Semantic Segmentation☆24Jan 30, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆47Apr 27, 2025Updated 11 months ago
- CVPR2021: Detecting Human-Object Interaction via Fabricated Compositional Learning☆16Jul 7, 2021Updated 4 years ago
- [TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding☆29Sep 11, 2024Updated last year
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆54Dec 28, 2023Updated 2 years ago
- spatio-temporal tasks☆16Jul 15, 2024Updated last year
- A simple Computer Vision Framework, mainly based on PyTorch. Including distributed training, logging and so on.☆12Dec 2, 2023Updated 2 years ago
- ☆26Aug 31, 2023Updated 2 years ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆341Apr 2, 2024Updated last year
- A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video☆34Apr 12, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This is an official implement for "HSTforU: Anomaly Detection in Aerial and Ground-based Videos with Hierarchical Spatio-Temporal Transfo…☆40Jan 30, 2025Updated last year
- This is the repo for an open project named detecting human object interactions in real-time☆15Jun 30, 2018Updated 7 years ago
- [ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric☆405Aug 15, 2024Updated last year
- PyTorch Tutorial to train ConvNets for Image Classification.☆11May 20, 2021Updated 4 years ago
- [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos☆12Jun 11, 2024Updated last year
- ☆35Dec 3, 2020Updated 5 years ago
- [CVPR 2024 Highlight] Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering☆10Jul 29, 2024Updated last year