klassif-ai / react-pdf-ner-annotator
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆59Updated last year
Alternatives and similar repositories for react-pdf-ner-annotator:
Users that are interested in react-pdf-ner-annotator are comparing it to the libraries listed below
- Run OCR, extract information from documents and classify them. In addition, annotate documents and build custom NLP and computer vision m…☆61Updated this week
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆63Updated this week
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆102Updated 9 months ago
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆51Updated 9 months ago
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆25Updated last year
- Custom recipe and utilities for document processing☆198Updated 2 years ago
- A study implementation of Gmail Smart Compose trained with Keras and used in browser with Tensorflow.js☆26Updated 2 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆174Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆46Updated 5 months ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 4 months ago
- React components for interactively highlighting parts of text.☆135Updated 2 years ago
- Software that makes labeling PDFs easy.☆403Updated 8 months ago
- 🏖TagEditor - Annotation tool for spaCy☆189Updated 2 years ago
- This project aims to extract text from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging t…☆24Updated 7 months ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆25Updated last year
- Docscan is a document scanner. Take a photo of your documents and frame it.☆99Updated 2 months ago
- Annotation layer for pdf.js☆270Updated 3 months ago
- A web-based document annotation tool, powered by GPT-4☆256Updated last year
- Tools for extract figure, table, text, .. from a pdf document.☆32Updated 4 years ago
- Repository for deepdoctection tutorial notebooks☆40Updated last month
- A basic tool that extracts the structure from the PDF files of scientific articles.☆74Updated 3 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆76Updated last year
- Logical structure analysis for visually structured documents☆85Updated 2 years ago
- 🖍️ Highlight text in documents☆99Updated 3 weeks ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆75Updated 3 years ago
- Build Semantic Search with S-BERT and Fine-tune your model in unsupervised way☆58Updated 2 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆66Updated 4 years ago
- Recognition of handwritten text using CRAFT text detection and TrOCR☆25Updated 2 years ago
- Document Layout Analysis☆359Updated 3 weeks ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆26Updated last year