klassif-ai / react-pdf-ner-annotatorLinks
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆62Updated 2 years ago
Alternatives and similar repositories for react-pdf-ner-annotator
Users that are interested in react-pdf-ner-annotator are comparing it to the libraries listed below
Sorting:
- Software that makes labeling PDFs easy.☆420Updated last year
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆107Updated last year
- A web-based document annotation tool, powered by GPT-4☆264Updated last year
- Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable p…☆86Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 7 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆74Updated this week
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 3 years ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆329Updated 2 years ago
- ☆385Updated last year
- Fully working applications that demonstrate how to use Haystack to implement various use cases☆134Updated 3 weeks ago
- Logical structure analysis for visually structured documents☆92Updated 3 years ago
- Repository for deepdoctection tutorial notebooks☆45Updated 4 months ago
- multimodal document analysis☆166Updated last year
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆63Updated last year
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆215Updated 2 years ago
- Document Layout Analysis☆391Updated this week
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆402Updated last year
- A line-based framework to detect and extract tabular data in JSON format from raster images using computer vision and Tesseract OCR.☆57Updated last month
- Viewer for the structure extracted by Grobid on PDF documents☆54Updated last month
- Document Search Engine Tool☆74Updated 2 years ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆78Updated 3 years ago
- 🖍️ Highlight text in documents☆109Updated 6 months ago
- Handwritten text detection in document images using Detectron2☆21Updated 3 years ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆29Updated 2 years ago
- Custom recipe and utilities for document processing☆200Updated 3 years ago
- PDF to XML ALTO file converter☆254Updated this week
- Open source no-code system for text annotation and building of text classifiers☆269Updated 5 months ago
- PDF text data extraction web app with OCR for scanned documents☆91Updated last year
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Updated 3 years ago
- Label data using HuggingFace's transformers and automatically get a prediction service☆193Updated 2 years ago