Malitha123 / awesome-video-self-supervised-learningLinks

A curated list of awesome self-supervised learning methods in videos

☆149

Alternatives and similar repositories for awesome-video-self-supervised-learning

Users that are interested in awesome-video-self-supervised-learning are comparing it to the libraries listed below

Sorting:

muzairkhattak / ViFi-CLIP
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
☆285Updated last year
OpenGVLab / video-mamba-suite
The suite of modeling video with Mamba
☆280Updated last year
ruiwang2021 / mvd
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…
☆133Updated 2 years ago
ispamm / GRAM
Official PyTorch repository for GRAM
☆85Updated 2 months ago
Visual-AI / FROSTER
The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"
☆85Updated 6 months ago
TalalWasim / Vita-CLIP
Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]
☆121Updated 2 years ago
NeeluMadan / ViFM_Survey
Foundation Models for Video Understanding: A Survey
☆129Updated 3 weeks ago
ttengwang / Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
☆284Updated 8 months ago
KHU-VLL / CAST
[NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"
☆53Updated last year
mingu6 / action_seg_ot
[CVPR 2024] - Official code for the paper "Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation"
☆40Updated 11 months ago
naver-ai / tc-clip
[ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"
☆69Updated 5 months ago
facebookresearch / mae_st
Official Open Source code for "Masked Autoencoders As Spatiotemporal Learners"
☆348Updated 8 months ago
benedettaliberatori / T3AL
Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024
☆64Updated 10 months ago
wgcban / adamae
[CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders
☆79Updated last year
bfshi / AbSViT
Official code for "Top-Down Visual Attention from Analysis by Synthesis" (CVPR 2023 highlight)
☆167Updated last year
antoyang / TubeDETR
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
☆183Updated last year
linziyi96 / st-adapter
☆81Updated 2 years ago
taoyang1122 / adapt-image-models
[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
☆291Updated last year
kyegomez / Vit-RGTS
Open source implementation of "Vision Transformers Need Registers"
☆184Updated 2 weeks ago
facebookresearch / EgoVLPv2
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
☆99Updated last year
OpenGVLab / efficient-video-recognition
☆176Updated 2 years ago
Finspire13 / DiffAct
Code for Diffusion Action Segmentation (ICCV 2023)
☆64Updated last year
fmu2 / snag_release
Official Implementation of SnAG (CVPR 2024)
☆51Updated 3 months ago
TalalWasim / Video-FocalNets
Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]
☆102Updated last year
OpenGVLab / unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
☆336Updated last year
dfan / webssl
Code for Scaling Language-Free Visual Representation Learning (WebSSL)
☆246Updated 3 months ago
MCG-NJU / AMD
[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models
☆18Updated last year
shashankvkt / DoRA_ICLR24
This repo contains the official implementation of ICLR 2024 paper "Is ImageNet worth 1 video? Learning strong image encoders from 1 long …
☆90Updated last year
xuyu0010 / awesome-video-domain-adaptation
A comprehensive collection of awesome research and other items about video domain adaptation
☆108Updated 6 months ago
all-things-vits / code-samples
Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and Interpreting Attention in Vision.
☆194Updated 2 years ago