Malitha123 / awesome-video-self-supervised-learningView external linksLinks
A curated list of awesome self-supervised learning methods in videos
☆166Dec 5, 2025Updated 2 months ago
Alternatives and similar repositories for awesome-video-self-supervised-learning
Users that are interested in awesome-video-self-supervised-learning are comparing it to the libraries listed below
Sorting:
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆41Feb 28, 2024Updated last year
- [NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts☆13Jan 30, 2024Updated 2 years ago
- This is the official implementation of Global-local Motion Transformer for Unsupervised Skeleton-based Action Learning (ECCV 2022).☆23Nov 6, 2023Updated 2 years ago
- ☆26Aug 31, 2023Updated 2 years ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆84Feb 2, 2024Updated 2 years ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,675Dec 8, 2023Updated 2 years ago
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Dec 19, 2023Updated 2 years ago
- Learning Debiased and Disentangled Representations for Semantic Segmentation (NeurIPS 2021)☆13Jan 23, 2022Updated 4 years ago
- official implementation of CVPR 23 paper "M3Video: Masked Motion Modeling for Self-Supervised Video Representation Learning"☆52Dec 8, 2023Updated 2 years ago
- Awesome papers & datasets specifically focused on long-term videos.☆352Oct 9, 2025Updated 4 months ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆49Mar 18, 2021Updated 4 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆25Jul 11, 2023Updated 2 years ago
- This is a repository contains the implementation of our AAAI'23 oral paper Hierarchical Contrast for Unsupervised Skeleton-based Action R…☆31Feb 15, 2023Updated 2 years ago
- ☆64Oct 27, 2023Updated 2 years ago
- [ECCV 2024] The official repo for "SA-DVAE: Improving Zero-Shot Skeleton-Based Action Recognition by Disentangled Variational Autoencoder…☆37Jul 19, 2024Updated last year
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆16Jun 1, 2025Updated 8 months ago
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- [Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)☆353Apr 23, 2025Updated 9 months ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆162May 30, 2022Updated 3 years ago
- Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"☆360Jan 12, 2026Updated last month
- Video datasets☆1,606Mar 8, 2023Updated 2 years ago
- [CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking☆750Oct 8, 2024Updated last year
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆108Jun 26, 2024Updated last year
- [CVPR 2024] - Official code for the paper "Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation"☆47Aug 22, 2024Updated last year
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Jan 17, 2026Updated 3 weeks ago
- The official project website of "Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition" (The paper of Ske2Grid is pub…☆19Sep 6, 2023Updated 2 years ago
- Code for "Class-Incremental Learning for Action Recognition in Videos", ICCV 2021☆21Oct 14, 2022Updated 3 years ago
- [PR 2024] TFS-ViT: Token-Level Feature Stylization for Domain Generalization☆25Mar 29, 2023Updated 2 years ago
- [ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model☆140Apr 9, 2024Updated last year
- 🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.☆3,066Dec 20, 2025Updated last month
- Foundation Models for Video Understanding: A Survey☆142Jul 9, 2025Updated 7 months ago
- A collection of awesome video generation studies.☆730Dec 27, 2025Updated last month
- ☆41May 7, 2022Updated 3 years ago
- [ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.☆49Dec 10, 2025Updated 2 months ago
- STPN - Weakly Supervised Action Localization by Sparse Temporal Pooling Network☆82Dec 6, 2018Updated 7 years ago
- ☆24Oct 11, 2017Updated 8 years ago
- Self-Supervised Learning by Cross-Modal Audio-Video Clustering (NeurIPS 2020)☆91Oct 24, 2022Updated 3 years ago
- ☆21Jul 3, 2025Updated 7 months ago
- [AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding☆91Nov 16, 2022Updated 3 years ago