text2knowledge / docTR-LabelerLinks
A OCR labeling tool - made for docTR
☆15Updated 3 weeks ago
Alternatives and similar repositories for docTR-Labeler
Users that are interested in docTR-Labeler are comparing it to the libraries listed below
Sorting:
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆154Updated 3 weeks ago
- A Python library to extract tabular data from PDFs☆66Updated 5 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆15Updated last year
- UniTable: Towards a Unified Table Foundation Model☆506Updated last year
- ☆71Updated last week
- This repo work as a sandbox enviroment for htrflow.☆36Updated 6 months ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆145Updated 4 months ago
- Recognition of handwritten text using CRAFT text detection and TrOCR☆26Updated 2 years ago
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆19Updated 3 weeks ago
- Library used to deskew a scanned document☆485Updated this week
- A fun party trick to run Python code from another venv into this one.☆203Updated 6 months ago
- Document Layout Analysis☆386Updated this week
- 🔢 Work with static vector models☆30Updated 5 months ago
- Page-wise text recognition with lower-supervision line data models☆46Updated 2 weeks ago
- HTRflow is the underlying engine for our HTR-pipeline☆62Updated 3 months ago
- Clean, filter and sample URLs to optimize data collection – Python & command-line – Deduplication, spam, content and language filters☆148Updated 9 months ago
- A YAML parser with advanced functionalities to ease your application configuration☆37Updated last week
- A visual labeling system implemented in Jupyter widgets.☆154Updated 10 months ago
- Generalist and Lightweight Model for Text Classification☆161Updated 3 months ago
- Confection: the sweetest config system for Python☆190Updated 5 months ago
- Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.☆12Updated last year
- A spaCy wrapper for GliNER☆121Updated 8 months ago
- Layout Analysis Dataset with Segmonto (LADaS)☆21Updated 2 months ago
- Python bindings to PDFium, reasonably cross-platform.☆647Updated this week
- Train huggingface models on top of Prodigy annotations☆21Updated last year
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆131Updated 2 weeks ago
- An OCR evaluation tool☆67Updated last month
- A Python implementation of Lunr.js 🌖☆200Updated 6 months ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆219Updated 8 months ago
- ☆21Updated 2 years ago