Reducing spatial redundancy in video recognition. SOTA computational efficiency.
☆127Dec 15, 2024Updated last year
Alternatives and similar repositories for AdaFocus
Users that are interested in AdaFocus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A general framework for inferring CNNs efficiently. Reduce the inference latency of MobileNet-V3 by 1.3x on an iPhone XS Max without sac…☆185Aug 27, 2023Updated 2 years ago
- [ECCV2020] Learn optimal resolution and skipping mechanism for efficient video understanding☆63Aug 17, 2020Updated 5 years ago
- ☆37Jul 8, 2021Updated 4 years ago
- Jittor implementation of Vision Transformer with Deformable Attention☆32Mar 1, 2022Updated 4 years ago
- Resolution adaptive network☆153Jul 17, 2022Updated 3 years ago
- ☆28Oct 6, 2022Updated 3 years ago
- [IEEE TPAMI] Latency-aware Unified Dynamic Networks for Efficient Image Recognition☆53Mar 20, 2025Updated last year
- Official repository of Uni-AdaFocus (TPAMI 2024).☆61Dec 17, 2024Updated last year
- Official implementation of A Mixture of Surprises for Unsupervised Reinforcement Learning☆23Nov 16, 2022Updated 3 years ago
- [NeurIPS 2022] Latency-aware Spatial-wise Dynamic Networks☆25Aug 21, 2023Updated 2 years ago
- Mutual Modality Learning code☆15Mar 1, 2021Updated 5 years ago
- Official implementation of Dynamic Perceiver☆43Nov 16, 2023Updated 2 years ago
- [ICLR2021] AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition☆34Apr 8, 2021Updated 4 years ago
- Accelerating T2t-ViT by 1.6-3.6x.☆259Nov 25, 2021Updated 4 years ago
- [CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…☆18Aug 1, 2022Updated 3 years ago
- [IEEE TIP] Fine-grained Recognition with Learnable Semantic Data Augmentation☆31Dec 23, 2023Updated 2 years ago
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.☆101Feb 26, 2023Updated 3 years ago
- [ICCV 2021] MGSampler: An Explainable Sampling Strategy for Video Action Recognition☆52Jul 9, 2022Updated 3 years ago
- ☆22Oct 27, 2021Updated 4 years ago
- Shapley values for assessing the importance of each frame in a video☆17Mar 1, 2021Updated 5 years ago
- [CVPR 2021] CondenseNet V2: Sparse Feature Reactivation for Deep Networks☆86Aug 27, 2022Updated 3 years ago
- ☆11Nov 5, 2024Updated last year
- Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 20…☆26Oct 15, 2021Updated 4 years ago
- ☆11Sep 15, 2017Updated 8 years ago
- Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better perfo…☆90Oct 12, 2022Updated 3 years ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆92Mar 16, 2023Updated 3 years ago
- unsupervised clustering, generative model, mixed membership stochastic block model, kmeans, spectral clustering, point cloud data☆13Mar 16, 2020Updated 6 years ago
- ☆19Mar 5, 2025Updated last year
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆153Jul 13, 2024Updated last year
- ☆16Jun 19, 2022Updated 3 years ago
- code of SOE-Net released in ICCV 2017☆15May 26, 2020Updated 5 years ago
- [ICLR2021] official implementation of CT-Net☆37Dec 29, 2021Updated 4 years ago
- [Pattern Recognition 2025] Cross-Modal Adapter for Vision-Language Retrieval☆140Aug 17, 2025Updated 7 months ago
- train cifar10 example with mixup method☆10Dec 30, 2017Updated 8 years ago
- This repository contains the annotations used for evaluating Unsupervised Domain Adaptation on EPIC Kitchens, with individual kitchens us…☆13Jun 2, 2020Updated 5 years ago
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆226Aug 23, 2024Updated last year
- Python script for downloading Kinetics datasets (Kinetics400, Kinetics600, Kinetics700)☆18Jun 8, 2020Updated 5 years ago
- Displaced Aggregation Units for Convolutional Networks from "Spatially-Adaptive Filter Units for Deep Neural Networks" paper☆21Jun 27, 2024Updated last year
- This is an official implementation for "Video Swin Transformers".☆1,638Mar 8, 2023Updated 3 years ago