ContentAndMaterialPortrait / MAE-GEBDLinks

CVPR’2022 Kinetics-GEBD Challenge

☆10

Alternatives and similar repositories for MAE-GEBD

Users that are interested in MAE-GEBD are comparing it to the libraries listed below

Sorting:

showlab / DemoVLP
[Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training
☆21Updated 3 years ago
AmeenAli / VideoMatch
☆12Updated 3 years ago
showlab / Region_Learner
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆42Updated 3 years ago
17Skye17 / VideoLT
Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)
☆34Updated 3 years ago
hello-jinwoo / LOVEU-CVPR2021
☆27Updated 2 years ago
TencentARC / TVTS
Turning to Video for Transcript Sorting
☆48Updated last year
alibaba / Deep-Vision
☆36Updated 3 years ago
LeeYN-43 / Clover
Offical PyTorch implementation of Clover: Towards A Unified Video-Language Alignment and Fusion Model (CVPR2023)
☆40Updated 2 years ago
rvl-lab-utoronto / video_similarity_search
SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]
☆19Updated 2 years ago
vt-vl-lab / video-data-aug
Learning Representational Invariances for Data-Efficient Action Recognition
☆33Updated 3 years ago
alibaba-mmai-research / HiCo
CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
☆17Updated 2 years ago
NNNNAI / Ego4d_NLQ_2022_1st_Place_Solution
The 1st place solution of 2022 Ego4d Natural Language Queries.
☆32Updated 2 years ago
Mark12Ding / FAME
[CVPR 2022] Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging
☆48Updated last year
IBM / AdaMML
Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.
☆51Updated 3 years ago
sauradip / TAGS
[ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…
☆17Updated 2 years ago
zhjohnchan / SK-VG
[CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.
☆31Updated 2 years ago
FingerRec / OA-Transformer
[CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》
☆62Updated 3 years ago
MCG-NJU / ZeroI2V
[ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
☆22Updated 11 months ago
papermsucode / mdmmt
MDMMT: Multidomain Multimodal Transformer for Video Retrieval
☆26Updated 4 years ago
Roc-Ng / HANet
PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).
☆47Updated 3 years ago
ByZ0e / Glance-Focus
This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)
☆27Updated last year
Trunpm / TPT-for-VideoQA
☆19Updated 2 years ago
farewellthree / STAN
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
☆104Updated last year
mzhaoshuai / CenterCLIP
[SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval. Also, a text-video retrieval toolbox based on CLIP + fast p…
☆132Updated 3 years ago
bighuang624 / VoP
[CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval
☆38Updated 2 years ago
IBM / sifar-pytorch
super image for action recognition
☆56Updated 3 years ago
tzhhhh123 / HC-STVG
The HC-STVG Dataset
☆56Updated 2 years ago
MCG-NJU / DDM
[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
☆50Updated 2 years ago
sauradip / fewshotQAT
[BMVC 2021]: Official PyTorch implementation of : "Few Shot Temporal Action Localization using Query Adaptive Transformers"
☆20Updated 3 years ago
kennymckormick / TransRank
[CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…
☆18Updated 2 years ago