OliverRensu / TinyMIM
☆156Updated last year
Alternatives and similar repositories for TinyMIM
Users that are interested in TinyMIM are comparing it to the libraries listed below
Sorting:
- A benchmark dataset for GRES and GREC [CVPR2023 Highlight]☆234Updated last year
- [ECCV2022] Factorizing Knowledge in Neural Networks☆89Updated 2 years ago
- [NeurIPS2022] Deep Model Reassembly☆253Updated last year
- [CVPR2023 Highlight] GRES: Generalized Referring Expression Segmentation☆695Updated last year
- [ICCV2021 & TPAMI2023] Vision-Language Transformer and Query Generation for Referring Segmentation☆354Updated 3 years ago
- [CVPR2023 Highlight] Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object Detection☆305Updated last year
- [ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes☆355Updated last year
- [ICCV'23 Oral] The introduction and toolkit for EqBen Benchmark☆127Updated last year
- [ECCV 2022] Patch Similarity Aware Data-Free Quantization for Vision Transformers☆123Updated 2 years ago
- Official pytorch implementation of paper "Inception Convolution with Efficient Dilation Search" (CVPR 2021 Oral).☆112Updated 3 years ago
- [ICCV 2023] MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions☆522Updated 10 months ago
- [CVPR-2023] Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation☆189Updated last year
- HRViT ("Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation"), CVPR 2022.☆192Updated 2 years ago
- This is an official implementation of our NeurIPS 22 paper“QueryPose: Sparse Multi-Person Pose Regression via Spatial-Aware Part-Level Qu…☆49Updated 2 years ago
- PyTorch implementation of the paper "MILAN: Masked Image Pretraining on Language Assisted Representation" https://arxiv.org/pdf/2208.0604…☆82Updated 2 years ago
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Updated 2 years ago
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆91Updated last year
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- [CVPR 2022 Oral] Towards Fewer Annotations: Active Learning via Region Impurity and Prediction Uncertainty for Domain Adaptive Semantic S…☆152Updated last year
- CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet☆215Updated 2 years ago
- Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"☆259Updated last year
- Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)☆335Updated last year
- Collection of awesome parameter-efficient fine-tuning resources.☆550Updated last month
- Filtering, Distillation, and Hard Negatives for Vision-Language Pre-Training☆137Updated 2 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆55Updated last year
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Updated 2 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆106Updated last year
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆98Updated last year
- [CVPR 2023 (Highlight)] Offical implementation of the paper "RepMode: Learning to Re-parameterize Diverse Experts for Subcellular Structu…☆169Updated last year
- [CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation☆111Updated 10 months ago