klassif-ai / react-pdf-ner-annotator
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆59Updated last year
Alternatives and similar repositories for react-pdf-ner-annotator
Users that are interested in react-pdf-ner-annotator are comparing it to the libraries listed below
Sorting:
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆104Updated last year
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆69Updated last month
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆61Updated last year
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆58Updated 2 years ago
- Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.☆11Updated last year
- 🏖TagEditor - Annotation tool for spaCy☆193Updated 2 years ago
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆27Updated last year
- Simple docker deployment of document layout analysis using detectron2☆19Updated 3 years ago
- Software that makes labeling PDFs easy.☆415Updated last year
- Document Layout Analysis☆373Updated this week
- Semantic Search + Keyword Search + Hybrid Search + Filtering + Faceting on 300K HN Comments☆51Updated 5 months ago
- Repository for deepdoctection tutorial notebooks☆45Updated 5 months ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆66Updated 4 years ago
- Logical structure analysis for visually structured documents☆89Updated 2 years ago
- Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)☆54Updated 2 years ago
- Handwritten text detection in document images using Detectron2☆20Updated 3 years ago
- Pipeline for converting PDFs to raw text with PaddleOCR☆23Updated last year
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 2 weeks ago
- A Named Entity Recognition + Entity Linker + Relation Extraction Pipeline built using spacy v3. Given a text, the pipeline will extract e…☆39Updated last year
- Research papers and code on information extraction from image/pdf☆97Updated 2 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆50Updated 2 months ago
- A web-based document annotation tool, powered by GPT-4☆260Updated last year
- ☆10Updated 3 years ago
- 🚀GUI for training spaCy models☆54Updated 4 years ago
- React components for interactively highlighting parts of text.☆138Updated 2 years ago
- Boilerplate Removal using Deep Learning☆82Updated 3 years ago
- multimodal document analysis☆164Updated 11 months ago
- ☆80Updated 3 years ago
- ☆22Updated last year
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated last year