Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”
☆12Mar 4, 2024Updated last year
Alternatives and similar repositories for VideoMAC
Users that are interested in VideoMAC are comparing it to the libraries listed below
Sorting:
- [ACM MM 2023] PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation☆12Aug 28, 2023Updated 2 years ago
- [ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation☆24Aug 26, 2023Updated 2 years ago
- RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark (ECCV 2024)☆31Sep 10, 2024Updated last year
- A Massive Multi-Discipline Lecture Understanding Benchmark☆32Nov 1, 2025Updated 3 months ago
- ☆37Jun 23, 2025Updated 8 months ago
- A repo for publishing solution to 3DCoMPaT++ challenge on an improved large-scale 3D vision dataset for compositional recognition☆14Jun 22, 2023Updated 2 years ago
- [IJCAI 2024] CoFInAl: Enhancing Action Quality Assessment with Coarse-to-Fine Instruction Alignment☆17Jul 16, 2024Updated last year
- [NeurIPS 2025 Spotlight] Generative Trajectory Stitching through Diffusion Composition☆68Sep 6, 2025Updated 5 months ago
- ReFLIP-VAD: Towards Weakly Supervised Video Anomaly Detection via Vision-Language Model☆14Nov 25, 2024Updated last year
- [ECCV 2024] STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment☆41Dec 27, 2023Updated 2 years ago
- This is the official implementation of "Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation"☆41Dec 1, 2024Updated last year
- ☆10Mar 31, 2025Updated 11 months ago
- Official code for "Weakly Supervised Two-Stage Training Scheme for Deep Video Fight Detection Model"☆12Oct 29, 2022Updated 3 years ago
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆17Jan 5, 2026Updated last month
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- The official repository of "MarineInst: A Foundation Model for Marine Image Analysis with Instance Visual Description". [ECCV Oral 2024.]☆17Sep 24, 2024Updated last year
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- ☆10Jun 5, 2024Updated last year
- ☆12Apr 19, 2024Updated last year
- ☆12Nov 18, 2023Updated 2 years ago
- An experiment with movie scenes and contrastive learning☆11Feb 1, 2025Updated last year
- ☆10May 29, 2024Updated last year
- ☆11Dec 13, 2023Updated 2 years ago
- 🌟 手把手教你在论文中插入代码链接☆24Aug 2, 2025Updated 6 months ago
- ICCV'23 | Adverse Weather Removal with Codebook Priors☆10Aug 28, 2023Updated 2 years ago
- Official Code for the NeurIPS'25 paper: Selective Learning for Deep Time Series Forecasting☆33Nov 7, 2025Updated 3 months ago
- ☆10Nov 13, 2025Updated 3 months ago
- classifier two-sample test for video anomaly detections☆11Jul 3, 2019Updated 6 years ago
- ☆10Feb 21, 2023Updated 3 years ago
- ☆12Mar 24, 2024Updated last year
- Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"☆14Oct 18, 2024Updated last year
- ☆11Dec 16, 2022Updated 3 years ago
- ☆11Jul 19, 2023Updated 2 years ago
- A repository of useful scripts for the course CS357 in the form of Jupyter Notebook.☆12Dec 11, 2021Updated 4 years ago
- ☆14Feb 10, 2025Updated last year
- DreamDance: Personalized Text-to-video Generation by Combining Text-to-Image Synthesis and Motion Transfer☆14Dec 16, 2022Updated 3 years ago
- Autopsy plugins meant to detect photo and video manipulations.☆13Sep 6, 2021Updated 4 years ago
- GDSC UOS Blog☆12Oct 31, 2023Updated 2 years ago