ayanban011 / GraphKD
[ICDAR 2024] (Best Student Paperπ) Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation
β13Updated 7 months ago
Alternatives and similar repositories for GraphKD:
Users that are interested in GraphKD are comparing it to the libraries listed below
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023β23Updated last year
- The official repo for the technical report "Scalable Mask Annotation for Video Text Spotting"β17Updated last year
- Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layoutβ18Updated 2 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.β61Updated 10 months ago
- [IJCAI2023] Your text images can be more clearer!β57Updated last year
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"β16Updated last year
- β37Updated last year
- β24Updated last year
- β14Updated 9 months ago
- [IJCV 2024] TransDETR: End-to-end Video Text Spotting with Transformerβ103Updated last year
- RoDLA: Benchmarking the Robustness of Document Layout Analysis Modelsβ34Updated last month
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentationβ71Updated 7 months ago
- Official PyTorch implementation of "CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection"β19Updated last year
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matchingβ22Updated 4 months ago
- β26Updated last year
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformerβ76Updated last year
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizerβ53Updated 10 months ago
- β15Updated last year
- Update the latest text-related papers from top conferencesβ24Updated last month
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.β28Updated 2 years ago
- The official implementation of our ECCV 2024 publication, PYRA (Parallel Yielding Re-Activation).β16Updated 7 months ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)β82Updated last year
- ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spottingβ36Updated 2 weeks ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`β17Updated last year
- Discovering De-similarities of Modular Structure Between Tumor Cells and Normal Cells by Integrating Multiple Data Sources Through Joint β¦β9Updated 7 months ago
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)β193Updated 10 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".β38Updated 3 months ago
- Evaluating GPT-4o's image generation and editing ability in OCR tasks.β41Updated 3 weeks ago
- [AAAI 2024] SRFormer: Text Detection Transformer with Incorporated Segmentation and Regressionβ64Updated 2 months ago
- Official implementation of ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining (AAAI 20β¦β51Updated 9 months ago