klassif-ai / react-pdf-ner-annotatorLinks
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆60Updated 2 years ago
Alternatives and similar repositories for react-pdf-ner-annotator
Users that are interested in react-pdf-ner-annotator are comparing it to the libraries listed below
Sorting:
- A web-based document annotation tool, powered by GPT-4☆262Updated last year
- Software that makes labeling PDFs easy.☆418Updated last year
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆106Updated last year
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 2 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Updated 4 years ago
- A JavaScript library for text annotation☆402Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆74Updated 3 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆70Updated this week
- multimodal document analysis☆165Updated last year
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆61Updated last year
- Python library to extract tabular data from images and scanned PDFs☆279Updated last year
- Simplify DOCX files to JSON☆246Updated 10 months ago
- Repository for deepdoctection tutorial notebooks☆46Updated last month
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- Parsing pdf tables using YOLOV3☆118Updated 4 years ago
- an extensible tool to generate hyperlinks from legal citations☆34Updated 10 months ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆77Updated 3 years ago
- 📑 Python Package to reconstruct the original continuous text from PDFs with language models☆32Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- ☆374Updated last year
- Document Layout Analysis☆381Updated 2 weeks ago
- Custom recipe and utilities for document processing☆199Updated 3 years ago
- Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)☆198Updated 2 years ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆326Updated last year
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆59Updated 3 years ago
- PDF to XML ALTO file converter☆248Updated this week
- Logical structure analysis for visually structured documents☆91Updated 2 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆213Updated last year
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆395Updated last year
- Ergonomic line-by-line transcription of scanned text.☆53Updated 4 years ago