klassif-ai / react-pdf-ner-annotatorLinks
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆59Updated 2 years ago
Alternatives and similar repositories for react-pdf-ner-annotator
Users that are interested in react-pdf-ner-annotator are comparing it to the libraries listed below
Sorting:
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆104Updated last year
- Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)☆197Updated 2 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆70Updated last week
- Document Layout Analysis☆376Updated 2 weeks ago
- Handwritten text detection in document images using Detectron2☆20Updated 3 years ago
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆58Updated 3 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆211Updated last year
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆27Updated last year
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆68Updated 4 years ago
- Docscan is a document scanner. Take a photo of your documents and frame it.☆103Updated 7 months ago
- Logical structure analysis for visually structured documents☆90Updated 2 years ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆206Updated 5 months ago
- Recognition of handwritten text using CRAFT text detection and TrOCR☆26Updated 2 years ago
- Page to PAGE Layout Analysis Tool☆191Updated 3 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated last month
- Tools for evaluating OCR performance relative to ground truth.☆10Updated last year
- A study implementation of Gmail Smart Compose trained with Keras and used in browser with Tensorflow.js☆28Updated 2 years ago
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆48Updated 2 years ago
- ☆80Updated 3 years ago
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆62Updated last year
- This repository contains a notebook to demonstrate the power of Document Text Recognition (DocTR) library☆13Updated 3 years ago
- A line-based framework to detect and extract tabular data in JSON format from raster images using computer vision and Tesseract OCR.☆57Updated last year
- This PyTorch implementation of LayoutLM paper by Microsoft demonstrate the SequenceClassfication task using HuggingFaceTransformers to cl…☆34Updated 2 years ago
- Simple docker deployment of document layout analysis using detectron2☆19Updated 3 years ago
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 4 years ago
- ☆370Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆51Updated 3 months ago
- ☆22Updated last year
- Detectron2 for Document Layout Analysis☆187Updated 10 months ago
- H&M Fashion Image similarity search with Weaviate and DocArray☆43Updated last year