ayanban011 / GraphKDLinks
[ICDAR 2024] (Best Student Paperπ) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation
β14Updated 11 months ago
Alternatives and similar repositories for GraphKD
Users that are interested in GraphKD are comparing it to the libraries listed below
Sorting:
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)β193Updated last year
- [AAAI'23 Oral] DPText-DETR: Towards Better Scene Text Detection with Dynamic Points in Transformerβ191Updated last year
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regressionβ66Updated 5 months ago
- [IJCAI2023] Your text images can be more clearer!β58Updated last year
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.β66Updated last year
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformerβ77Updated last year
- [ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspectiveβ189Updated last year
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matchingβ26Updated 2 months ago
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20β¦β55Updated last year
- A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research wβ¦β87Updated 8 months ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023β26Updated 2 years ago
- The official repo for [CVPR'23] "DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text Spotting" & [ArXiv'23] "DeepSolo++:β¦β274Updated 2 months ago
- β38Updated last year
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizerβ53Updated last year
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"β26Updated last year
- Official implementation of SPTS: Single-Point Text Spotting (ACM MM 2022 Oral)β143Updated 2 years ago
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Modelsβ36Updated 4 months ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)β83Updated last year
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformerβ104Updated last year
- Official PyTorch implementation of "CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection"β21Updated last year
- A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)β181Updated 3 years ago
- β26Updated last year
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"β15Updated last year
- The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normβ¦β34Updated 3 years ago
- The official code of CornerTransformer (ECCV 2022, Oral) on top of MMOCR.β145Updated 2 years ago
- (CVPR 2022) Text Spotting Transformersβ187Updated 2 years ago
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.β28Updated 3 years ago
- Real-CE: A Benchmark for Chinese-English Scene Text Image Super-resolution (ICCV2023)β88Updated last year
- [PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Caβ¦β62Updated 3 weeks ago
- Official code for DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degraβ¦β34Updated 2 months ago