A curated list of awesome self-supervised learning methods in videos
☆169Mar 10, 2026Updated last week
Alternatives and similar repositories for awesome-video-self-supervised-learning
Users that are interested in awesome-video-self-supervised-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆41Feb 28, 2024Updated 2 years ago
- ☆26Aug 31, 2023Updated 2 years ago
- This is the official implementation of Global-local Motion Transformer for Unsupervised Skeleton-based Action Learning (ECCV 2022).☆23Nov 6, 2023Updated 2 years ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆84Feb 2, 2024Updated 2 years ago
- Official implementation of the ICCV 2023 paper "Masked Motion Predictors are Strong 3D Action Representation Learners"☆51Sep 22, 2023Updated 2 years ago
- This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …☆95May 17, 2024Updated last year
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,698Dec 8, 2023Updated 2 years ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 9 months ago
- This is a repository contains the implementation of our AAAI'23 oral paper Hierarchical Contrast for Unsupervised Skeleton-based Action R…☆31Feb 15, 2023Updated 3 years ago
- Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations ICCV23☆30Dec 30, 2024Updated last year
- Github repo for referring atomic video action recognition☆20Oct 2, 2024Updated last year
- Hierarchical Consistent Contrastive Learning for Skeleton-Based Action Recognition with Growing Augmentations, AAAI 2023☆29Dec 8, 2022Updated 3 years ago
- ☆28Oct 8, 2023Updated 2 years ago
- ☆20May 11, 2025Updated 10 months ago
- official implementation of CVPR 23 paper "M3Video: Masked Motion Modeling for Self-Supervised Video Representation Learning"☆52Dec 8, 2023Updated 2 years ago
- ☆64Oct 27, 2023Updated 2 years ago
- [Main Conference @ EACL'26] [Workshop @ NeurIPS'24] 🎞️ LVNet.☆42Feb 10, 2026Updated last month
- Learning Debiased and Disentangled Representations for Semantic Segmentation (NeurIPS 2021)☆13Jan 23, 2022Updated 4 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆25Jul 11, 2023Updated 2 years ago
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆30Dec 1, 2022Updated 3 years ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆14Jun 1, 2025Updated 9 months ago
- [Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)☆353Apr 23, 2025Updated 11 months ago
- A curated list of awesome temporal action segmentation resources.☆246Apr 4, 2024Updated last year
- Pytorch implementation of Swin MAE https://arxiv.org/abs/2212.13805☆103Jul 7, 2025Updated 8 months ago
- Official code repository for SPAct: Self-supervised Privacy Preservation for Action Recognition [CVPR-2022]☆21Jun 5, 2022Updated 3 years ago
- Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"☆364Jan 12, 2026Updated 2 months ago
- Official repository for "Self-Supervised Video Transformer" (CVPR'22)☆108Jun 26, 2024Updated last year
- ☆32Sep 12, 2024Updated last year
- Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆37Apr 3, 2023Updated 2 years ago
- Learning An Effective Transformer for Remote Sensing Satellite Image Dehazing☆12Sep 25, 2023Updated 2 years ago
- ☆22Jul 3, 2025Updated 8 months ago
- Awesome papers & datasets specifically focused on long-term videos.☆360Oct 9, 2025Updated 5 months ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆49Mar 18, 2021Updated 5 years ago
- ☆23Nov 29, 2024Updated last year
- [ICCV 2025] Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning.☆53Mar 17, 2026Updated last week
- [PR 2024] TFS-ViT: Token-Level Feature Stylization for Domain Generalization☆25Mar 29, 2023Updated 2 years ago
- Video datasets☆1,619Mar 8, 2023Updated 3 years ago
- 🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.☆3,116Updated this week
- S2ME: Spatial-Spectral Mutual Teaching and Ensemble Learning for Scribble-supervised Polyp Segmentation (MICCAI 2023)☆20Dec 1, 2023Updated 2 years ago