WISION-Lab / eventful-transformerLinks

Code for our paper "Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers"

☆36

Alternatives and similar repositories for eventful-transformer

Users that are interested in eventful-transformer are comparing it to the libraries listed below

Sorting:

yuzhms / Streaming-Video-Model
[CVPR2023] Code for "Streaming Video Model"
☆79Updated 2 years ago
Ali2500 / BURST-benchmark
☆78Updated 2 years ago
janghyuncho / DECOLA
Code release for "Language-conditioned Detection Transformer"
☆88Updated last year
CVMI-Lab / CoDet
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
☆123Updated last year
showlab / sparseformer
(ICLR 2024, CVPR 2024) SparseFormer
☆75Updated last year
Nathan-Li123 / SMOTer
[ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking
☆53Updated last year
jozhang97 / DETA
Detection Transformers with Assignment
☆265Updated 2 years ago
KainingYing / CTVIS
[ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation
☆80Updated 2 years ago
V3Det / V3Det
☆120Updated last year
OpenGVLab / MUTR
「AAAI 2024」 Referred by Multi-Modality: A Unified Temporal Transformers for Video Object Segmentation
☆82Updated 7 months ago
SysCV / cascade-detr
[ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection
☆99Updated 2 years ago
HengLan / SMOT
[ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking
☆29Updated last year
facebookresearch / MeMViT
Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022
☆153Updated 3 years ago
bytedance / OmniScient-Model
This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model
☆99Updated last year
miranheo / GenVIS
[CVPR'23] A Generalized Framework for Video Instance Segmentation
☆136Updated 2 years ago
Surrey-UP-Lab / RegionSpot
Recognize Any Regions
☆123Updated last year
lxtGH / Tube-Link
[ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS
☆110Updated last year
renwang435 / video-ttt-release
Test-Time Training on Video Streams
☆66Updated 2 years ago
FelixCaae / AlignDETR
[BMVC 2024] Official implementation of Align-DETR
☆61Updated last year
sukjunhwang / VITA
VITA: Video Instance Segmentation via Object Token Association (NeurIPS 2022)
☆105Updated 2 years ago
SysCV / ovtrack
OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]
☆112Updated last year
SwinTransformer / AiT
☆110Updated 2 years ago
wudongming97 / OnlineRefer
[ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
☆57Updated 2 years ago
impiga / Plain-DETR
[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design
☆226Updated 2 years ago
OpenGVLab / M3I-Pretraining
[CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.
☆91Updated 2 years ago
jiawen-zhu / TrackGPT
Tracking with Human-Intent Reasoning
☆74Updated last year
CASIA-LMC-Lab / Obj2Seq
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)
☆85Updated 3 years ago
guanxiongsun / vfe.pytorch
Video Feature Enhancement with PyTorch
☆32Updated last year
HengLan / VastTrack
[NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking
☆73Updated 4 months ago
Atten4Vis / GroupDETR
[ICCV 2023] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
☆43Updated 2 years ago