klassif-ai / react-pdf-ner-annotator
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆59Updated last year
Alternatives and similar repositories for react-pdf-ner-annotator:
Users that are interested in react-pdf-ner-annotator are comparing it to the libraries listed below
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆67Updated 3 weeks ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆66Updated 4 years ago
- Document Layout Analysis☆368Updated last week
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆103Updated last year
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆57Updated 2 years ago
- Software that makes labeling PDFs easy.☆410Updated 11 months ago
- Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)☆198Updated 2 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆50Updated last month
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated last week
- A web-based document annotation tool, powered by GPT-4☆259Updated last year
- DocILE: Document Information Localization and Extraction Benchmark☆125Updated 11 months ago
- Parsing pdf tables using YOLOV3☆116Updated 4 years ago
- ☆79Updated 3 years ago
- Recognition of handwritten text using CRAFT text detection and TrOCR☆26Updated 2 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆210Updated last year
- Document Search Engine Tool☆73Updated 2 years ago
- Annotation layer for pdf.js☆280Updated 7 months ago
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆60Updated last year
- Public runnable examples of using John Snow Labs' OCR for Apache Spark.☆90Updated 2 weeks ago
- Table Detection using Deep Learning☆26Updated 3 years ago
- ☆359Updated last year
- BoxDetect is a Python package based on OpenCV which allows you to easily detect rectangular shapes like character or checkbox boxes on sc…☆109Updated 2 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆48Updated 3 years ago
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆26Updated last year
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 4 years ago
- Simple docker deployment of document layout analysis using detectron2☆19Updated 3 years ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆27Updated 2 years ago
- LLM Based OCR and Document Parsing for Node.js☆103Updated 7 months ago
- Handwritten text detection in document images using Detectron2☆20Updated 3 years ago
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 2 years ago