klassif-ai / react-pdf-ner-annotatorLinks
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆61Updated 2 years ago
Alternatives and similar repositories for react-pdf-ner-annotator
Users that are interested in react-pdf-ner-annotator are comparing it to the libraries listed below
Sorting:
- Software that makes labeling PDFs easy.☆420Updated last year
- A web-based document annotation tool, powered by GPT-4☆263Updated last year
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆107Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆75Updated 3 years ago
- Simplify DOCX files to JSON☆251Updated last year
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆62Updated last year
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 3 years ago
- an extensible tool to generate hyperlinks from legal citations☆36Updated 11 months ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆327Updated last year
- A JavaScript library for text annotation☆406Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 6 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆73Updated this week
- multimodal document analysis☆166Updated last year
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆179Updated 2 years ago
- Python library to extract tabular data from images and scanned PDFs☆282Updated last year
- EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-lea…☆56Updated 7 months ago
- ☆64Updated last year
- Logical structure analysis for visually structured documents☆91Updated 3 years ago
- Multiple and Large PDF Documents Text Extraction.☆131Updated 7 months ago
- Ergonomic line-by-line transcription of scanned text.☆53Updated 4 years ago
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆107Updated 4 years ago
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Updated 3 years ago
- Fully working applications that demonstrate how to use Haystack to implement various use cases☆130Updated last week
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆77Updated last year
- Annotation layer for pdf.js☆288Updated last year
- Question Answering annotation platform - Plateforme d'annotation☆90Updated 8 months ago
- Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)☆55Updated 3 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆396Updated last year
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆69Updated 4 years ago
- PDF to XML ALTO file converter☆254Updated 2 weeks ago