☆38Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for CLIPOCR
Users that are interested in CLIPOCR are comparing it to the libraries listed below
Sorting:
- [IJCAI-2024] The official code of Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition☆10Aug 10, 2025Updated 6 months ago
- [ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective☆201Nov 1, 2023Updated 2 years ago
- The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)☆27Sep 3, 2023Updated 2 years ago
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer☆78Apr 9, 2024Updated last year
- Turning a CLIP Model into a Scene Text Detector (CVPR2023) | Turning a CLIP Model into a Scene Text Spotter (TPAMI)☆201Jun 17, 2024Updated last year
- ☆27Feb 20, 2024Updated 2 years ago
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Jun 12, 2025Updated 8 months ago
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆146Nov 14, 2025Updated 3 months ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆16Nov 3, 2023Updated 2 years ago
- ☆42Sep 2, 2023Updated 2 years ago
- Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition☆28Aug 29, 2023Updated 2 years ago
- Implementation of Bidirectional Scene Text Recognition with a Single Decoder☆65Nov 24, 2024Updated last year
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year
- It's the code for the paper Pushing the Performance Limit of Scene Text Recognizer without Human Annotation, CVPR 2022.☆28Jul 6, 2022Updated 3 years ago
- ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting☆86Feb 11, 2023Updated 3 years ago
- Pytorch re-implementation of Paper: SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition (…☆288Nov 29, 2024Updated last year
- [ICCV2023] Self-supervised Character-to-Character Distillation for Text Recognition☆151Apr 20, 2024Updated last year
- Official PyTorch implementation of `Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition`☆74Feb 27, 2023Updated 3 years ago
- Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features …☆73Jun 24, 2023Updated 2 years ago
- A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)☆105Dec 9, 2021Updated 4 years ago
- ☆14May 26, 2023Updated 2 years ago
- ☆188Feb 27, 2024Updated 2 years ago
- ☆78Aug 7, 2023Updated 2 years ago
- Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)☆126Nov 13, 2023Updated 2 years ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Sep 22, 2023Updated 2 years ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Oct 5, 2023Updated 2 years ago
- H. Zhang, Q. Yao, M. Yang, Y. Xu, X. Bai. AutoSTR: Efficient Backbone Search for Scene Text Recognition. European Conference on Computer …☆84Aug 6, 2020Updated 5 years ago
- A comprehensive list [Hi-SAM@TPAMI'24, GoMatching@NeurIPS'24, DeepSolo(++)@ CVPR'23, DPText-DETR@AAAI'23, I3CL@IJCV'22] of our research w…☆93Nov 12, 2024Updated last year
- convert paddleOCR to torchOCR, ppocr-v3,ppocr-v4, onnx, openvino☆33Aug 16, 2023Updated 2 years ago
- Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining☆353Nov 29, 2023Updated 2 years ago
- A network for irregular text recognition.☆26Dec 11, 2020Updated 5 years ago
- The official implementation of SPTS v2: Single-Point Text Spotting☆140Jun 29, 2023Updated 2 years ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Sep 12, 2023Updated 2 years ago
- Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition☆459Oct 14, 2022Updated 3 years ago
- Code for ICCV 2023 Paper : “ICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extraction”☆54Aug 8, 2023Updated 2 years ago
- ☆60Jul 25, 2023Updated 2 years ago
- A Dead Simple and Modularized Multi-Modal Training and Finetune Framework. Compatible to any LLaVA/Flamingo/QwenVL/MiniGemini etc series …☆19Apr 24, 2024Updated last year
- The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…☆34Jun 21, 2022Updated 3 years ago
- Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)☆692May 29, 2024Updated last year