klassif-ai / react-pdf-ner-annotatorLinks
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆61Updated 2 years ago
Alternatives and similar repositories for react-pdf-ner-annotator
Users that are interested in react-pdf-ner-annotator are comparing it to the libraries listed below
Sorting:
- Software that makes labeling PDFs easy.☆425Updated last year
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆105Updated last year
- A web-based document annotation tool, powered by GPT-4☆265Updated 2 years ago
- 🏖TagEditor - Annotation tool for spaCy☆193Updated 3 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆53Updated 9 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆80Updated last week
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆329Updated 2 years ago
- ☆389Updated 2 years ago
- Custom recipe and utilities for document processing☆200Updated 3 years ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆29Updated 2 years ago
- EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-lea…☆59Updated 11 months ago
- Named Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.☆591Updated 10 months ago
- Parsing pdf tables using YOLOV3☆121Updated 4 years ago
- Scripts for Medium articles☆61Updated last year
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆79Updated 4 years ago
- Multiple and Large PDF Documents Text Extraction.☆131Updated 11 months ago
- Fully working applications that demonstrate how to use Haystack to implement various use cases☆135Updated last month
- PDF parser and converter to HTML☆90Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆76Updated 4 years ago
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆27Updated 2 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Updated 5 years ago
- Simplify DOCX files to JSON☆256Updated last year
- PDF text data extraction web app with OCR for scanned documents☆95Updated last year
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Updated 3 years ago
- A tool for converting PDF into hOCR with text, tables, and figures being recognized and preserved.☆458Updated 2 years ago
- Repository for deepdoctection tutorial notebooks☆48Updated last week
- Document Search Engine Tool☆76Updated 3 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)☆55Updated 3 years ago
- Semantic Search + Keyword Search + Hybrid Search + Filtering + Faceting on 300K HN Comments☆55Updated last year