daniel-code/TubeViT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/daniel-code/TubeViT)

daniel-code / TubeViT

An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"

☆95

Alternatives and similar repositories for TubeViT

Users that are interested in TubeViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ruiwang2021 / mvd
View on GitHub
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…
☆135May 21, 2023Updated 3 years ago
whwu95 / BIKE
View on GitHub
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
☆156Sep 9, 2024Updated last year
OpenGVLab / UniFormerV2
View on GitHub
[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
☆350Apr 2, 2024Updated 2 years ago
OpenGVLab / unmasked_teacher
View on GitHub
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
☆348May 27, 2024Updated 2 years ago
alibaba-mmai-research / TAdaConv
View on GitHub
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, vi…
☆246Aug 23, 2023Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
yueyang2000 / CKA_minibatch_pytorch
View on GitHub
Pytorch implementation of Centered Kernel Alignment(CKA) and its minibatch version.
☆11May 11, 2022Updated 4 years ago
HJYao00 / Side4Video
View on GitHub
☆42Apr 7, 2024Updated 2 years ago
YihengZhang-CV / MCL-Motion-Focused-Contrastive-Learning
View on GitHub
☆15Jan 11, 2022Updated 4 years ago
alibaba-mmai-research / DiST
View on GitHub
ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
☆41Sep 25, 2023Updated 2 years ago
CASIA-IVA-Lab / VRoPE
View on GitHub
[EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.
☆28Nov 18, 2025Updated 8 months ago
Hypnosx / Kinetics-TPS
View on GitHub
ICCV DeeperAction Challenge - Kinetics-TPS Challenge on Part-level Action Parsing and Action Recognition.
☆13Jun 4, 2021Updated 5 years ago
OpenGVLab / efficient-video-recognition
View on GitHub
☆184Aug 20, 2022Updated 3 years ago
KHU-VLL / CAST
View on GitHub
[NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"
☆55Dec 28, 2023Updated 2 years ago
99eren99 / DIS25k
View on GitHub
Official repository of "Deep Image Composition Meets Image Forgery"
☆13May 30, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
MCG-NJU / AMD
View on GitHub
[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models
☆18Jan 11, 2026Updated 6 months ago
facebookresearch / MeMViT
View on GitHub
Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022
☆155Nov 30, 2022Updated 3 years ago
TalalWasim / Vita-CLIP
View on GitHub
Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]
☆126Jul 1, 2023Updated 3 years ago
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,775Dec 8, 2023Updated 2 years ago
OpenGVLab / InternVideo
View on GitHub
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
☆2,339Jul 2, 2026Updated 2 weeks ago
XiaoBuL / OmniCLIP
View on GitHub
[ECAI-2024] OmniCLIP: Adapting CLIP for Video Recognition with Spatial-Temporal Omni-Scale Feature Learning
☆16Jan 7, 2025Updated last year
YangLiu9208 / TCGL
View on GitHub
[IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning
☆24Dec 19, 2023Updated 2 years ago
sming256 / AdaTAD
View on GitHub
[CVPR2024] The official implementation of AdaTAD: End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
☆42Jul 9, 2024Updated 2 years ago
sukjunhwang / set_classifier
View on GitHub
Official PyTorch implementation of: "Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in V…
☆14Aug 29, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
taoyang1122 / adapt-image-models
View on GitHub
[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
☆298Sep 17, 2023Updated 2 years ago
Necolizer / ISTA-Net
View on GitHub
[IROS 2023] Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition
☆21Jul 12, 2025Updated last year
jd730 / STRG
View on GitHub
Pytorch Implementation of Videos as Space-Time Region Graphs
☆27Updated this week
mx-mark / VideoTransformer-pytorch
View on GitHub
PyTorch implementation of a collections of scalable Video Transformer Benchmarks.
☆306May 4, 2022Updated 4 years ago
facebookresearch / TimeSformer
View on GitHub
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
☆1,863Apr 9, 2024Updated 2 years ago
alanaai / EVUD
View on GitHub
Egocentric Video Understanding Dataset (EVUD)
☆34Jul 4, 2024Updated 2 years ago
UCDvision / PatchSearch
View on GitHub
Code for the CVPR '23 paper, "Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning"
☆10Jun 9, 2023Updated 3 years ago
LeapLabTHU / Uni-AdaFocus
View on GitHub
Official repository of Uni-AdaFocus (TPAMI 2024).
☆59Dec 17, 2024Updated last year
southnx / ACoLP
View on GitHub
Open Set Video HOI detection from Action-centric Chain-of-Look Prompting, ICCV2023
☆12Oct 3, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
nick11roberts / XD
View on GitHub
☆12Jul 6, 2022Updated 4 years ago
SCZwangxiao / video-ReTaKe
View on GitHub
Official implementation of paper ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding
☆40Mar 16, 2025Updated last year
Doronbh7 / segment-anything-2-fine-tune
View on GitHub
Segment-Anything-2 (SAM 2) fine tune with COCO data
☆15Aug 20, 2024Updated last year
Zaabon / spiking_yolo
View on GitHub
☆12Mar 24, 2021Updated 5 years ago
srmauvsoftware / srmauv
View on GitHub
SRM Autonomous Underwater Vehicle code
☆12Oct 30, 2020Updated 5 years ago
guoshengcv / CACL
View on GitHub
[CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning
☆24Jul 5, 2022Updated 4 years ago
kyusik-cho / CMOM
View on GitHub
Code for <Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing> in WACV 2023
☆12Jan 26, 2023Updated 3 years ago