klassif-ai / react-pdf-ner-annotator
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆59Updated last year
Alternatives and similar repositories for react-pdf-ner-annotator:
Users that are interested in react-pdf-ner-annotator are comparing it to the libraries listed below
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆103Updated 11 months ago
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆59Updated 11 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆48Updated last week
- Software that makes labeling PDFs easy.☆406Updated 10 months ago
- Document Layout Analysis☆360Updated this week
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 2 weeks ago
- Annotation layer for pdf.js☆278Updated 6 months ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆66Updated 4 years ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆77Updated 3 years ago
- Custom recipe and utilities for document processing☆199Updated 2 years ago
- Handwritten text detection in document images using Detectron2☆20Updated 3 years ago
- React components for interactively highlighting parts of text.☆137Updated 2 years ago
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆56Updated 2 years ago
- Simple docker deployment of document layout analysis using detectron2☆19Updated 3 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆207Updated last year
- Pipeline for converting PDFs to raw text with PaddleOCR☆21Updated last year
- Ergonomic line-by-line transcription of scanned text.☆51Updated 4 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆67Updated this week
- A study implementation of Gmail Smart Compose trained with Keras and used in browser with Tensorflow.js☆26Updated 2 years ago
- ☆22Updated last year
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆195Updated 2 months ago
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 2 years ago
- Semantic Search + Keyword Search + Hybrid Search + Filtering + Faceting on 300K HN Comments☆49Updated 3 months ago
- 🚀GUI for training spaCy models☆54Updated 3 years ago
- ☆10Updated 3 years ago
- PDF to XML ALTO file converter☆233Updated last week
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆48Updated last year
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆27Updated last year
- Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)☆198Updated 2 years ago
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆26Updated last year