megvii-research / MSCL
[ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation
☆17Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for MSCL
- Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)☆45Updated last year
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆70Updated 9 months ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆14Updated last year
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆21Updated 4 months ago
- [CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…☆18Updated 2 years ago
- Turning to Video for Transcript Sorting☆46Updated last year
- Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)☆29Updated 3 years ago
- ☆17Updated 7 months ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆31Updated last year
- ☆31Updated 3 years ago
- i-mae Pytorch Repo☆19Updated 7 months ago
- ☆52Updated last year
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆21Updated 2 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆16Updated 2 years ago
- This repository contains the code for our CVPR 2022 paper on "Audio-visual Generalised Zero-shot Learning with Cross-modal Attention and …☆34Updated last year
- Rethinking Nearest Neighbors for Visual Classification☆31Updated 2 years ago
- ☆34Updated 2 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆36Updated last month
- [ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, M…☆17Updated last month
- ☆16Updated last year
- ☆32Updated 6 months ago
- LAEO-Net++☆20Updated 3 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆32Updated 3 years ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated last year
- Official Code of ECCV 2022 paper MS-CLIP☆86Updated 2 years ago
- This repository contains the code for our ECCV 2022 paper "Temporal and cross-modal attention for audio-visual zero-shot learning"☆24Updated last year
- ☆22Updated last year
- ☆47Updated 2 years ago
- Compress conventional Vision-Language Pre-training data☆49Updated last year