Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.
☆52Jul 13, 2022Updated 3 years ago
Alternatives and similar repositories for STTS
Users that are interested in STTS are comparing it to the libraries listed below
Sorting:
- [CVPR 2022] Official repository of AdaFocusV2.☆91Dec 15, 2024Updated last year
- ☆12Dec 15, 2023Updated 2 years ago
- [ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆41Nov 29, 2023Updated 2 years ago
- Official PyTorch implementation of ResFormer: Scaling ViTs with Multi-Resolution Training, CVPR2023☆30Jun 22, 2023Updated 2 years ago
- This repository contains the official implementation of CoMix (NeurIPS 2021) https://arxiv.org/pdf/2110.15128.pdf.☆22Jan 12, 2022Updated 4 years ago
- Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)☆19Mar 9, 2024Updated 2 years ago
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆84Jul 1, 2024Updated last year
- ☆21Jan 17, 2025Updated last year
- ☆36Nov 4, 2022Updated 3 years ago
- Code for Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.☆10Sep 28, 2021Updated 4 years ago
- ☆26Dec 26, 2024Updated last year
- This repository is the official Pytorch implementation of Balanced Product of Calibrated Experts for Long-Tailed Recognition (CVPR 2023).☆18Mar 13, 2025Updated last year
- PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.☆112Nov 6, 2022Updated 3 years ago
- AFNet(NeurIPS 2022)☆20Nov 24, 2022Updated 3 years ago
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Jul 29, 2022Updated 3 years ago
- [CVPR2022] SVIP: Sequence VerIfication for Procedures in Videos☆24Feb 24, 2023Updated 3 years ago
- Accelerating T2t-ViT by 1.6-3.6x.☆259Nov 25, 2021Updated 4 years ago
- [CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection☆86Feb 19, 2023Updated 3 years ago
- ☆120May 12, 2022Updated 3 years ago
- Code for our paper "Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers"☆36Jan 27, 2026Updated last month
- ☆21Mar 1, 2023Updated 3 years ago
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Dec 19, 2023Updated 2 years ago
- [CVPR 2022] End-to-End Semi-Supervised Learning for Video Action Detection☆35May 3, 2023Updated 2 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 2 years ago
- Deep Multi-layer Fusion Dense Network for Hyperspectral Image Classification.☆11Apr 25, 2021Updated 4 years ago
- The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)☆16Nov 12, 2024Updated last year
- [ECCV 2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval☆79Nov 29, 2022Updated 3 years ago
- ☆182Aug 20, 2022Updated 3 years ago
- ☆16May 12, 2025Updated 10 months ago
- Implementation of ViViT: A Video Vision Transformer☆557Jun 21, 2021Updated 4 years ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆341Apr 2, 2024Updated last year
- ☆11Sep 4, 2024Updated last year
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆306May 4, 2022Updated 3 years ago
- Circuit Synthesis for Yao's Garbled Circuit by TinyGarble☆11Sep 25, 2020Updated 5 years ago
- ☆11Mar 31, 2023Updated 2 years ago
- ☆17Feb 1, 2023Updated 3 years ago
- Extracting optical flow and frames☆318Mar 1, 2022Updated 4 years ago
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆20Jul 29, 2025Updated 7 months ago
- Project page for the 'CLAWS: Clustering Assisted Weakly Supervised Learning with Normalcy Suppression for Anomalous Event Detection', ECC…☆12May 29, 2021Updated 4 years ago