Handwritten Text Recognition and Character Detection
☆165Sep 28, 2025Updated 5 months ago
Alternatives and similar repositories for DTLR
Users that are interested in DTLR are comparing it to the libraries listed below
Sorting:
- (EarthVision 2025 - CVPR Workshop) Official repository of DAFA-LS, a dataset of satellite image time series for the task of archaeologica…☆38Nov 21, 2024Updated last year
- The Learnable Typewriter: A Generative Approach to Text Line Analysis☆34Oct 31, 2024Updated last year
- Reliability in Semantic Segmentation: Can We Use Synthetic Data? (ECCV 2024)☆41Jul 17, 2024Updated last year
- Multi-Camera Hand-Eye Calibration Framework for calibrating a camera network with respect to a robot arm☆32Jan 21, 2026Updated last month
- Implementation of the multi-temporal UTAE for the task of satellite image time series semantic change detection (SITS-SCD)☆60Jul 11, 2024Updated last year
- Historical Diagram Vectorization☆19Nov 25, 2025Updated 3 months ago
- official implementation of the Polynomial Mixer☆23Sep 15, 2025Updated 5 months ago
- ☆77Oct 25, 2024Updated last year
- Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"☆88Jun 6, 2025Updated 8 months ago
- Code for "Don’t drop your samples! Coherence-aware training benefits Conditional diffusion" CVPR 2024 Highlight☆57Jul 24, 2025Updated 7 months ago
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paper☆95Nov 13, 2025Updated 3 months ago
- [AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documents☆105Jul 15, 2025Updated 7 months ago
- ☆90Oct 24, 2024Updated last year
- Basic HTR concepts/modules to boost performance☆39Nov 30, 2024Updated last year
- ☆14Oct 7, 2021Updated 4 years ago
- Toolbox for the Earth Parser Dataset, a dataset presented in the "Learnable Earth Parser: Discovering 3D Prototypes in Aerial Scans" pape…☆26Aug 23, 2023Updated 2 years ago
- ☆19Oct 1, 2021Updated 4 years ago
- Official Pytorch implementation of the "A Model You Can Hear: Audio Identification with Playable Prototypes" paper☆37Aug 8, 2022Updated 3 years ago
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregator☆12Apr 28, 2024Updated last year
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- ☆17Jul 9, 2024Updated last year
- (ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper☆88May 25, 2023Updated 2 years ago
- 修正文档扭曲/模糊/阴影等情况,使用onnx模型简单轻量部署,未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We wi…☆95Dec 17, 2025Updated 2 months ago
- Computer vision platform for the Digital Humanities☆27Updated this week
- Official PyTorch Implementation of "Rethinking HTG Evaluation: Bridging Generation and Recognition" (Oral) - 1st Workshop on Critical Eva…☆17Sep 23, 2024Updated last year
- High order Moment Models☆41Nov 13, 2025Updated 3 months ago
- SPRINT: Script-agnostic Structure Recognition in Tables☆16Mar 26, 2025Updated 11 months ago
- A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition☆198Feb 11, 2026Updated 2 weeks ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆312Aug 15, 2025Updated 6 months ago
- PyTorch implementation of "Representing Shape Collections with Alignment-Aware Linear Models" paper.☆30May 27, 2022Updated 3 years ago
- This repository contains content related to 2D and 3D lane detection, as well as video lane detection. There are not only papers here, bu…☆13Sep 1, 2024Updated last year
- (IGARSS 2025) Prototype-based method for agricultural image time series classification.☆45Sep 5, 2024Updated last year
- ☆18Sep 23, 2025Updated 5 months ago
- Code for SCAM! Transferring humans between images with Semantic Cross Attention Modulation. Also contains implementation for SPADE, CLADE…☆56Nov 8, 2022Updated 3 years ago
- The official code for the CVPR 2024 paper: Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer☆55Jun 14, 2024Updated last year
- utility functions for CIL☆20Jun 18, 2024Updated last year
- (CVPRW 2022) Learning Co-segmentation by Segment Swapping for Retrieval and Discovery☆53Oct 27, 2022Updated 3 years ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,017Apr 14, 2025Updated 10 months ago
- Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)☆67Jun 6, 2024Updated last year