klassif-ai / react-pdf-ner-annotatorLinks
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆61Updated 2 years ago
Alternatives and similar repositories for react-pdf-ner-annotator
Users that are interested in react-pdf-ner-annotator are comparing it to the libraries listed below
Sorting:
- Software that makes labeling PDFs easy.☆420Updated last year
- A web-based document annotation tool, powered by GPT-4☆264Updated last year
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 3 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆72Updated last week
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Updated 3 years ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆78Updated 3 years ago
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆62Updated last year
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 5 months ago
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆107Updated last year
- Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)☆55Updated 3 years ago
- Parsing pdf tables using YOLOV3☆118Updated 4 years ago
- Python library to extract tabular data from images and scanned PDFs☆283Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆75Updated 3 years ago
- Document Search Engine Tool☆74Updated 2 years ago
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆54Updated 2 years ago
- multimodal document analysis☆167Updated last year
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆327Updated 2 years ago
- Simplify DOCX files to JSON☆253Updated last year
- Question Answering annotation platform - Plateforme d'annotation☆90Updated 8 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 7 months ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Updated 4 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- ☆63Updated last year
- Custom recipe and utilities for document processing☆200Updated 3 years ago
- Handwritten text detection in document images using Detectron2☆20Updated 3 years ago
- React components for interactively highlighting parts of text.☆138Updated 2 years ago
- ☆384Updated last year
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆180Updated 2 years ago
- PDF to XML ALTO file converter☆253Updated last month
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆61Updated 3 years ago