text2knowledge / docTR-LabelerLinks
A OCR labeling tool - made for docTR
☆15Updated this week
Alternatives and similar repositories for docTR-Labeler
Users that are interested in docTR-Labeler are comparing it to the libraries listed below
Sorting:
- A Python library to extract tabular data from PDFs☆66Updated 5 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆14Updated last year
- A spaCy wrapper for GliNER☆117Updated 7 months ago
- A purely-functional HTML builder for Python. Think JSX rather than templates.☆100Updated 8 months ago
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆147Updated 3 weeks ago
- A fun party trick to run Python code from another venv into this one.☆203Updated 5 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆146Updated last month
- A Jupyter widget for annotating images with bounding boxes☆136Updated last year
- Python bindings to PDFium, reasonably cross-platform.☆633Updated this week
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆18Updated last year
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆143Updated 3 months ago
- Generalist and Lightweight Model for Text Classification☆156Updated 2 months ago
- A YAML parser with advanced functionalities to ease your application configuration☆37Updated this week
- Confection: the sweetest config system for Python☆190Updated 5 months ago
- 🦦 weasel: A small and easy workflow system☆85Updated last year
- ☆23Updated last year
- An easy way to chunk spaCy docs.☆22Updated last year
- ☆83Updated 3 months ago
- A component orchestration engine☆28Updated last year
- Pydantic extension for annotating autocorrecting fields.☆222Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Small python package to measure OCR quality and other related metrics.☆25Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆347Updated 5 months ago
- Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"☆14Updated last year
- A bit of extra usability for sqlite☆210Updated 2 months ago
- Recognition of handwritten text using CRAFT text detection and TrOCR☆26Updated 2 years ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆238Updated 3 months ago
- Document Layout Analysis☆383Updated last week
- 🔢 Work with static vector models☆29Updated 4 months ago
- A Python implementation of Lunr.js 🌖☆199Updated 6 months ago