klassif-ai / react-pdf-ner-annotator
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆57Updated last year
Related projects ⓘ
Alternatives and complementary repositories for react-pdf-ner-annotator
- Document Layout Analysis☆350Updated this week
- Run OCR, extract information from documents and classify them. In addition, annotate documents and build custom NLP and computer vision m…☆62Updated last week
- Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)☆198Updated last year
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆46Updated 2 years ago
- Software that makes labeling PDFs easy.☆391Updated 6 months ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆203Updated last year
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆48Updated 7 months ago
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆181Updated 2 years ago
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆101Updated 7 months ago
- A JavaScript library for text annotation☆369Updated 7 months ago
- ☆10Updated 2 years ago
- Parsing pdf tables using YOLOV3☆114Updated 3 years ago
- PDF to XML ALTO file converter☆216Updated 2 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆44Updated 3 months ago
- ☆329Updated 10 months ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆65Updated 4 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 3 months ago
- Handwritten text detection in document images using Detectron2☆19Updated 2 years ago
- Google Colab Demo of CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents☆46Updated 3 years ago
- Repository for deepdoctection tutorial notebooks☆39Updated 4 months ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆173Updated last year
- ☆36Updated 4 years ago
- Python library to extract tabular data from images and scanned PDFs☆264Updated 3 months ago
- ☆74Updated 2 years ago
- Integrate AI-powered Document Analysis Pipelines☆62Updated this week
- Custom recipe and utilities for document processing☆198Updated 2 years ago
- Detectron2 for Document Layout Analysis☆185Updated 3 months ago
- Docscan is a document scanner. Take a photo of your documents and frame it.☆95Updated last week
- Simple docker deployment of document layout analysis using detectron2☆19Updated 3 years ago
- multimodal document analysis☆160Updated 5 months ago