[ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding
☆26Oct 16, 2023Updated 2 years ago
Alternatives and similar repositories for MGMAE
Users that are interested in MGMAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆38Aug 20, 2024Updated last year
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 9 months ago
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆20Jul 29, 2024Updated last year
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Jul 15, 2022Updated 3 years ago
- ☆11Sep 4, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation☆27Mar 28, 2025Updated last year
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆39Aug 29, 2023Updated 2 years ago
- Unofficial implementation of PointNet and PointNet++☆10Oct 26, 2023Updated 2 years ago
- [ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model☆24Nov 1, 2024Updated last year
- [TIP] APP-Net: Auxiliary-point-based Push and Pull Operations for Efficient Point Cloud Recognition☆13May 15, 2023Updated 2 years ago
- [ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆41Nov 29, 2023Updated 2 years ago
- [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models☆18Jan 11, 2026Updated 3 months ago
- [ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking☆227Oct 15, 2025Updated 6 months ago
- [CVPR 2024] SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos☆17May 21, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A fast and efficient way to compute a differentiable bound on the singular values of convolution layers☆12Nov 22, 2019Updated 6 years ago
- Implementation for paper : EM-driven unsupervised learning for efficient motion segmentation☆18Feb 16, 2023Updated 3 years ago
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆63May 18, 2023Updated 2 years ago
- Official code for the paper: MAR: Masked Autoencoders for Efficient Action Recognition☆32Dec 7, 2022Updated 3 years ago
- CVPR2021: Detecting Human-Object Interaction via Fabricated Compositional Learning☆16Jul 7, 2021Updated 4 years ago
- [TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding☆29Sep 11, 2024Updated last year
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆54Dec 28, 2023Updated 2 years ago
- ☆22Jul 3, 2025Updated 10 months ago
- spatio-temporal tasks☆16Jul 15, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆26Aug 31, 2023Updated 2 years ago
- [CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…☆18Nov 4, 2025Updated 6 months ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆345Apr 2, 2024Updated 2 years ago
- This is an official implement for "HSTforU: Anomaly Detection in Aerial and Ground-based Videos with Hierarchical Spatio-Temporal Transfo…☆42Jan 30, 2025Updated last year
- This is the repo for an open project named detecting human object interactions in real-time☆15Jun 30, 2018Updated 7 years ago
- [ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric☆420Aug 15, 2024Updated last year
- PyTorch Tutorial to train ConvNets for Image Classification.☆11May 20, 2021Updated 4 years ago
- [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos☆11Jun 11, 2024Updated last year
- ☆35Dec 3, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Appearance-Motion Memory Consistency Network for Video Anomaly Detection☆36Oct 31, 2022Updated 3 years ago
- [NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and R…☆38Sep 27, 2025Updated 7 months ago
- Reproducing and Improving CheXNet: Deep Learning for Chest X-ray Disease Classification☆18Mar 24, 2026Updated last month
- [ICCV 2023] SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes☆206Jul 24, 2023Updated 2 years ago
- Generate a denotation graph from a set of image captions☆16Sep 4, 2018Updated 7 years ago
- Deep learning of an embedding mapping using t-SNE as a loss function on top of a 3-hidden-layer neural network. Use pytorch !☆20Jul 14, 2017Updated 8 years ago
- [ICDM 2022] Making Reconstruction-based Method Great Again for Video Anomaly Detection (PyTorch)☆40Mar 25, 2024Updated 2 years ago