klassif-ai / react-pdf-ner-annotator
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆56Updated last year
Related projects: ⓘ
- Run OCR, extract information from documents and classify them. In addition, annotate documents and build custom NLP and computer vision m…☆60Updated this week
- Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)☆199Updated last year
- Software that makes labeling PDFs easy.☆383Updated 4 months ago
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆97Updated 5 months ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆41Updated last month
- TableNet: Deep Learning model for end-to-end Table Detection and Tabular data extraction from Scanned Data Images In modern times, more a…☆42Updated 2 years ago
- 🏖TagEditor - Annotation tool for spaCy☆185Updated last year
- A web-based document annotation tool, powered by GPT-4☆243Updated 8 months ago
- Simple docker deployment of document layout analysis using detectron2☆20Updated 2 years ago
- The scripts for training Detectron2-based Layout Models on popular layout analysis datasets☆200Updated 11 months ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆72Updated 2 years ago
- A project about benchmarking and evaluating existing PDF extraction tools on their semantic abilities to extract the body texts from PDF …☆63Updated 3 years ago
- Document Layout Analysis☆335Updated this week
- A spaCy wrapper for GliNER☆77Updated 2 months ago
- Information extraction from English and German texts based on predicate logic☆133Updated last year
- multimodal document analysis☆159Updated 3 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆71Updated last year
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆166Updated last year
- Pipeline for converting PDFs to raw text with PaddleOCR☆20Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆208Updated 3 months ago
- Logical structure analysis for visually structured documents☆80Updated 2 years ago
- Spacy NER annotator using ipywidgets☆120Updated 5 months ago
- ☆316Updated 8 months ago
- 🦦 weasel: A small and easy workflow system☆63Updated 2 months ago
- Build Semantic Search with S-BERT and Fine-tune your model in unsupervised way☆57Updated 2 years ago
- Label data using HuggingFace's transformers and automatically get a prediction service☆175Updated last year
- semantically distinct key phrase extraction using hilbert hashes.☆46Updated 2 years ago
- A Dataset of German Legal Documents for Named Entity Recognition☆156Updated last year
- Repository for deepdoctection tutorial notebooks☆36Updated last month
- Integrate AI-powered Document Analysis Pipelines☆58Updated last week