[ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.
☆67Oct 11, 2022Updated 3 years ago
Alternatives and similar repositories for SimViT
Users that are interested in SimViT are comparing it to the libraries listed below
Sorting:
- This is a offical PyTorch/GPU implementation of SupMAE.☆79Aug 30, 2022Updated 3 years ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆92Dec 27, 2022Updated 3 years ago
- The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer☆116Aug 30, 2023Updated 2 years ago
- Official Codes and Pretrained Models for Dynamic MLP, CVPR2022, https://arxiv.org/abs/2203.03253☆87Mar 8, 2022Updated 4 years ago
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- This repo holds the research projects of our lab.☆11Jan 20, 2024Updated 2 years ago
- Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)☆137May 24, 2023Updated 2 years ago
- ☆70Mar 10, 2025Updated last year
- [NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"☆377Sep 16, 2022Updated 3 years ago
- ☆21Jul 27, 2019Updated 6 years ago
- This is the official repo of Panoptic SegFormer [CVPR'22]☆240Mar 3, 2022Updated 4 years ago
- Reading list for research topics in Masked Image Modeling☆335Dec 3, 2024Updated last year
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- Code for "Long-tail Detection with Effective Class-Margins." (ECCV 2022 Oral)☆62Sep 2, 2023Updated 2 years ago
- ☆59Jun 17, 2022Updated 3 years ago
- This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"☆197Jan 11, 2023Updated 3 years ago
- Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)☆198Aug 24, 2022Updated 3 years ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Dec 23, 2022Updated 3 years ago
- This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".☆292Sep 28, 2022Updated 3 years ago
- BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training☆400Oct 23, 2024Updated last year
- [ECCV2022] This is an official implementation of paper "RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentati…☆78Feb 12, 2023Updated 3 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆29Jan 23, 2024Updated 2 years ago
- ConvMAE: Masked Convolution Meets Masked Autoencoders☆523Mar 14, 2023Updated 3 years ago
- ☆13Nov 7, 2021Updated 4 years ago
- This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".☆1,029Sep 29, 2022Updated 3 years ago
- Official implementation for paper "Relational Surrogate Loss Learning", ICLR 2022☆37Nov 25, 2022Updated 3 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- Summaries of machine learning papers☆12Aug 19, 2022Updated 3 years ago
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆113Jun 9, 2023Updated 2 years ago
- [ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation☆56Oct 8, 2022Updated 3 years ago
- [NeurIPS'22] Projector Ensemble Feature Distillation☆30Jan 4, 2024Updated 2 years ago
- Official PyTorch implementation of Fully Attentional Networks☆481Mar 31, 2023Updated 2 years ago
- ☆64Jan 22, 2022Updated 4 years ago
- Official Code of ECCV 2022 paper MS-CLIP☆91Jul 27, 2022Updated 3 years ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- Generate multiple choice fill-in-the-blank questions from any article.☆13Dec 8, 2022Updated 3 years ago
- Two simple and effective designs of vision transformer, which is on par with the Swin transformer☆608Feb 14, 2023Updated 3 years ago
- Joint learning of saliency detection and weakly supervised semantic segmentation☆25Sep 18, 2020Updated 5 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Nov 29, 2021Updated 4 years ago