Optocal Character Recognition (OCR / HTR) using Transformers
☆11Aug 20, 2022Updated 3 years ago
Alternatives and similar repositories for OCR-TR
Users that are interested in OCR-TR are comparing it to the libraries listed below
Sorting:
- Hadwritten Text Recognition in Few-shot Scenario☆22Mar 25, 2023Updated 2 years ago
- Let there be clock in the beach - WACV 2022☆15Nov 15, 2021Updated 4 years ago
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Jul 12, 2023Updated 2 years ago
- WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching☆16Dec 10, 2021Updated 4 years ago
- [WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆22Oct 11, 2025Updated 4 months ago
- A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers☆59Sep 9, 2024Updated last year
- ICDAR 2019☆25Aug 2, 2019Updated 6 years ago
- Based on the WACV 2020 paper - Fine Grained Classification and Retrieval by Combining Visual and Locally Pooled Textual Features☆25Nov 15, 2021Updated 4 years ago
- [ECCV'24] NamedCurves: Learned Image Enhancement via Color Naming☆33Sep 8, 2025Updated 5 months ago
- The official repo of the Comics Survey: "A missing piece in Vision and Language: A Survey on Comics Understanding"☆133Jan 2, 2025Updated last year
- 青岛船舶检测☆13Apr 16, 2025Updated 10 months ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆16Nov 20, 2024Updated last year
- Official code of our ICCV paper "A Fast Unified System for 3D Object Detection and Tracking"☆10Sep 29, 2023Updated 2 years ago
- 【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending☆14Jun 16, 2025Updated 8 months ago
- Decompose complex polygons into sets of triangles☆10Oct 4, 2020Updated 5 years ago
- Tool to train/test models on 3d point cloud segmentation☆10Jun 14, 2025Updated 8 months ago
- Graph Metric Learning in PyTorch☆10Apr 7, 2021Updated 4 years ago
- Code for CLVision workshop (CVPR 2024) paper - Calibrating Higher-Order Statistics for Few-Shot Class-Incremental Learning with Pre-train…☆11Nov 12, 2024Updated last year
- Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals☆12May 24, 2024Updated last year
- [CVPR 2024 Highlight] - Stationary Representations: Optimally Approximating Compatibility and Implications for Improved Model Replacement…☆13Oct 21, 2024Updated last year
- OCR Annotations from Amazon Textract for Industry Documents Library☆103Aug 20, 2022Updated 3 years ago
- Accompanying code for "Analyzing Vision Tranformers in Class Embedding Space" (NeurIPS '23)☆15Jun 10, 2024Updated last year
- ☆10Jan 22, 2023Updated 3 years ago
- ☆10Oct 25, 2019Updated 6 years ago
- A library for novices who want to experiment with Machine Learning☆12May 19, 2024Updated last year
- STVQA and TextVQA OCR results from Amazon Text in Image pipeline☆12Jul 18, 2022Updated 3 years ago
- This repository contains the source code, models and data files for the work titled: "Unsupervised Image Style Embeddings for Retrieval a…☆13May 29, 2021Updated 4 years ago
- Document Image Enhancement with GANs - TPAMI journal☆214Mar 24, 2023Updated 2 years ago
- Official implementation of "Relational Proxies: Emergent Relationships as Fine-Grained Discriminators", NeurIPS 2022.☆14Feb 1, 2025Updated last year
- ☆12Mar 28, 2024Updated last year
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆137Oct 18, 2025Updated 4 months ago
- [COG24] - Official repository of "OfflineMania: A Benchmark Environment for Offline Reinforcement Learning in Racing Games"☆12Jul 15, 2024Updated last year
- A telegram bot that sends you a message when the GPU is in use☆10May 27, 2024Updated last year
- 🔥🔥[NeurIPS2025]Exploring and mitigating semantic hallucinations in scene text perception and reasoning☆26Dec 11, 2025Updated 2 months ago
- Official code of our WACV paper "ECSIC: Epipolar Cross Attention for Stereo Image Compression"☆14Dec 27, 2023Updated 2 years ago
- ☆10Jan 28, 2016Updated 10 years ago
- CLIP-based simple image-text matching baseline for COCO and F30K☆14Sep 16, 2021Updated 4 years ago
- Python code to extract depth and rgb data from rosbag☆14Nov 24, 2022Updated 3 years ago
- A PyTorch implementation of ClipPrompt based on CVPR 2023 paper "CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained…☆18Nov 5, 2023Updated 2 years ago