Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.
☆52Jul 13, 2022Updated 3 years ago
Alternatives and similar repositories for STTS
Users that are interested in STTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Dec 15, 2023Updated 2 years ago
- [ICCV2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆41Nov 29, 2023Updated 2 years ago
- This repository contains the official implementation of CoMix (NeurIPS 2021) https://arxiv.org/pdf/2110.15128.pdf.☆22Jan 12, 2022Updated 4 years ago
- Official PyTorch implementation of ResFormer: Scaling ViTs with Multi-Resolution Training, CVPR2023☆30Jun 22, 2023Updated 2 years ago
- Contrastive Video Question Answering via Video Graph Transformer (IEEE T-PAMI'23)☆19Mar 9, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)☆87Jul 1, 2024Updated last year
- ☆21Jan 17, 2025Updated last year
- ☆36Nov 4, 2022Updated 3 years ago
- ☆12Jul 30, 2019Updated 6 years ago
- Code for Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.☆10Sep 28, 2021Updated 4 years ago
- ☆26Dec 26, 2024Updated last year
- This repository is the official Pytorch implementation of Balanced Product of Calibrated Experts for Long-Tailed Recognition (CVPR 2023).☆18Mar 13, 2025Updated last year
- ☆58Dec 2, 2025Updated 5 months ago
- PyDeepFakeDet is an integrated and scalable tool for Deepfake detection.☆114Nov 6, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- AFNet(NeurIPS 2022)☆20Nov 24, 2022Updated 3 years ago
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Jul 29, 2022Updated 3 years ago
- Accelerating T2t-ViT by 1.6-3.6x.☆260Nov 25, 2021Updated 4 years ago
- [CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection☆87Feb 19, 2023Updated 3 years ago
- ☆120May 12, 2022Updated 3 years ago
- Code for our paper "Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers"☆38Jan 27, 2026Updated 3 months ago
- Risky Object Localization (ROL) in a Driving Scene Dataset☆15Dec 24, 2023Updated 2 years ago
- Hyperspectral Imagery One Class Classification (ISPRS 2022 & TGRS 2023)☆13Jan 28, 2026Updated 3 months ago
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Dec 19, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2022] End-to-End Semi-Supervised Learning for Video Action Detection☆35May 3, 2023Updated 2 years ago
- ☆33Jul 28, 2022Updated 3 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆37Apr 3, 2023Updated 3 years ago
- Deep Multi-layer Fusion Dense Network for Hyperspectral Image Classification.☆11Apr 25, 2021Updated 5 years ago
- The official GitHub repository for AC-EVAL, an ancient Chinese evaluation suite for large language models (LLMs)☆16Nov 12, 2024Updated last year
- [ECCV 2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval☆79Nov 29, 2022Updated 3 years ago
- ☆184Aug 20, 2022Updated 3 years ago
- Implementation of ViViT: A Video Vision Transformer☆558Jun 21, 2021Updated 4 years ago
- [ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer☆344Apr 2, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Sep 4, 2024Updated last year
- PyTorch implementation of a collections of scalable Video Transformer Benchmarks.☆308May 4, 2022Updated 3 years ago
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆55Oct 21, 2025Updated 6 months ago
- ☆37Jul 8, 2021Updated 4 years ago
- Extracting optical flow and frames☆319Mar 1, 2022Updated 4 years ago
- ☆17Feb 1, 2023Updated 3 years ago
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆21Jul 29, 2025Updated 9 months ago