ytpeng-aimlab / Multi-Stage-Partitioned-Transformer-for-Efficient-Image-Deraining
☆14Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Multi-Stage-Partitioned-Transformer-for-Efficient-Image-Deraining
- ☆13Updated 8 months ago
- The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:…☆248Updated 3 months ago
- ☆10Updated last year
- Domain-Generalized Face Anti-Spoofing with Unknown Attacks. ICIP, 2023☆25Updated last year
- An open-source implementaion for fine-tuning Qwen2-VL series by Alibaba Cloud.☆117Updated 2 weeks ago
- [ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective☆171Updated last year
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)☆181Updated 5 months ago
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆237Updated last month
- ☆68Updated last year
- [TPAMI'24] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation☆211Updated last week
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆23Updated last year
- Soft Mixture of Experts Vision Transformer, addressing MoE limitations as highlighted by Puigcerver et al., 2023.☆12Updated last year
- BoT-SORT: Robust Associations Multi-Pedestrian Tracking☆158Updated 6 months ago
- ☆69Updated last year
- Official implementation for AnomalyCLIP (ICLR 2024)☆284Updated 3 months ago
- [AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformer☆174Updated last year
- [CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts☆297Updated 4 months ago
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆300Updated last month
- Fine tuning grounding Dino☆60Updated this week
- A curated list of papers, datasets and resources pertaining to open vocabulary object detection.☆285Updated 4 months ago
- This is the repository for paper UniQA☆12Updated 5 months ago
- [ICDAR 2024] (Best Student Paper🏆) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation☆12Updated 2 months ago
- Valeo Anomaly Dataset (VAD)☆19Updated 5 months ago
- This repo lists relevant papers summarized in our survey paper: A Systematic Survey of Prompt Engineering on Vision-Language Foundation …☆390Updated last month
- Document Artifical Intelligence☆131Updated last month
- Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆252Updated 2 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆50Updated 5 months ago
- ☆156Updated 8 months ago
- Downstream-Dino-V2: A GitHub repository featuring an easy-to-use implementation of the DINOv2 model by Facebook for downstream tasks such…☆195Updated last year
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆116Updated last month