klassif-ai / react-pdf-ner-annotatorLinks
Annotate entities directly onto a PDF with automatic OCR for scanned PDFs
☆61Updated 2 years ago
Alternatives and similar repositories for react-pdf-ner-annotator
Users that are interested in react-pdf-ner-annotator are comparing it to the libraries listed below
Sorting:
- A web-based document annotation tool, powered by GPT-4☆263Updated last year
- Software that makes labeling PDFs easy.☆418Updated last year
- `pdfstructure` detects, splits and organizes the documents text content into its natural structure as envisioned by the author.☆106Updated last year
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 2 years ago
- Simplify DOCX files to JSON☆248Updated 11 months ago
- A JavaScript library for text annotation☆403Updated last year
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆326Updated last year
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆72Updated this week
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆61Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 5 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Updated 3 years ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago
- Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)☆55Updated 3 years ago
- Python library to extract tabular data from images and scanned PDFs☆281Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆75Updated 3 years ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆77Updated 3 years ago
- Document Search Engine Tool☆74Updated 2 years ago
- PDF to XML ALTO file converter☆252Updated 3 weeks ago
- ☆376Updated last year
- Table Detection and Extraction Using Deep Learning ( It is built in Python, using Luminoth, TensorFlow<2.0 and Sonnet.)☆198Updated 2 years ago
- NLP Web API for Legal Text☆18Updated 2 years ago
- Document Search Engine project with TF-IDF abd Google universal sentence encoder model☆54Updated 2 years ago
- Custom recipe and utilities for document processing☆199Updated 3 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆395Updated last year
- Semantic Segmentation of Legal texts that labels sentences with one of 7 rhetorical roles.☆77Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆219Updated 7 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- LegalCrawler: A tool for automated scraping of English legal corpora☆56Updated 3 years ago
- Question Answering annotation platform - Plateforme d'annotation☆90Updated 7 months ago