cankocagil / SwinDetrLinks
Integration of Swin Transformer to DETR for Robust Object Detection (DEMO)
☆30Updated 4 years ago
Alternatives and similar repositories for SwinDetr
Users that are interested in SwinDetr are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of Sparse DETR☆175Updated 2 years ago
- ☆37Updated 3 years ago
- Sequencer: Deep LSTM for Image Classification☆142Updated 3 years ago
- ☆266Updated 3 years ago
- MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning☆146Updated 2 years ago
- ☆55Updated last year
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Updated 3 years ago
- [CVPR 2023]Implementation of Siamese Image Modeling for Self-Supervised Vision Representation Learning☆41Updated last year
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Updated 3 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆66Updated 9 months ago
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining☆107Updated 9 months ago
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆56Updated 3 years ago
- Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)☆104Updated last year
- ☆63Updated 2 years ago
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆113Updated 2 years ago
- Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)☆166Updated 3 years ago
- [CVPR 2022] This repository includes the official project for the paper: TransMix: Attend to Mix for Vision Transformers.☆158Updated 3 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆57Updated 2 years ago
- Source code for CVPR 2022 paper Sylph A Hypernetwork Framework for Few-shot Object Detection☆72Updated 3 years ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆106Updated 2 years ago
- Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"☆245Updated 3 years ago
- [ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"☆144Updated 3 years ago
- Leaderboard, taxonomy, and curated list of few-shot object detection papers.☆112Updated 4 years ago
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆91Updated 2 years ago
- ☆205Updated last year
- [CVPR-2022] Official implementation for "Knowledge Distillation with the Reused Teacher Classifier".☆102Updated 3 years ago
- This repo contains the code and configuration files for reproducing object detection results of FocalNets with DINO☆68Updated 2 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆108Updated 2 years ago
- A Siamese self-supervised pretraining approach for the Transformer architecture in DETR☆37Updated 2 years ago
- PyTorch implementation of Semi-supervised Vision Transformers☆61Updated 3 years ago