taoyang1122 / adapt-image-modelsLinks
[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
☆294Updated 2 years ago
Alternatives and similar repositories for adapt-image-models
Users that are interested in adapt-image-models are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆375Updated 3 years ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆333Updated last year
- Official code for "Top-Down Visual Attention from Analysis by Synthesis" (CVPR 2023 highlight)☆168Updated 2 years ago
- Video Swin Transformer - PyTorch☆267Updated 3 years ago
- Code Release for MViTv2 on Image Recognition.☆444Updated 11 months ago
- ☆191Updated 3 years ago
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆304Updated 3 years ago
- Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]☆126Updated 2 years ago
- ☆555Updated 3 years ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆540Updated 2 years ago
- Reading list for research topics in Masked Image Modeling☆336Updated 11 months ago
- [ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, vi…☆241Updated 2 years ago
- ☆84Updated 2 years ago
- The codes for TCFormer in paper: Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer☆242Updated last year
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆297Updated last year
- ☆180Updated 3 years ago
- PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529☆164Updated 3 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆93Updated last year
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆198Updated 2 years ago
- A curated list of awesome self-supervised learning methods in videos☆157Updated last week
- ☆643Updated last year
- ☆215Updated 2 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆133Updated 2 years ago
- ☆264Updated 2 years ago
- ☆49Updated 3 years ago
- A comprehensive collection of awesome research and other items about video domain adaptation☆110Updated 10 months ago
- The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)☆295Updated last month
- Official Open Source code for "Scaling Language-Image Pre-training via Masking"☆427Updated 2 years ago
- Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23☆223Updated last year
- [ECCV 2022] LAFF for Text-to-Video Retrieval☆46Updated last year