maiaPhilippe / pdf-to-textLinks
PDF OCR using Pure Javascript by tesseract.js api
☆21Updated 7 years ago
Alternatives and similar repositories for pdf-to-text
Users that are interested in pdf-to-text are comparing it to the libraries listed below
Sorting:
- brings autocomplete to Quill Placeholder module☆11Updated 6 years ago
- ☆13Updated last year
- Annotate entities directly onto a PDF with automatic OCR for scanned PDFs☆59Updated 2 years ago
- Annotation layer for pdf.js☆285Updated 10 months ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆233Updated this week
- Find legal citations in any block of text☆160Updated 3 weeks ago
- Web based JavaScript GUI library for proofreading/editing hOCR☆95Updated 6 years ago
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 4 years ago
- Normalized dataset of 70k job titles☆70Updated last year
- A React component for annotating PDF, powered by PDF.js and RecogitoJS☆61Updated last year
- A JavaScript library for text annotation☆401Updated last year
- Ancient Greek language models for spaCy☆31Updated 4 months ago
- ☆12Updated last year
- ☆90Updated last year
- Wrapper for PDF JS to add annotations☆366Updated 3 years ago
- Images of example pages from Transkribus model training sets to make it easier to find a match.☆13Updated 3 years ago
- 📚 Materials for Advanced Legal Analytics (LAW3027) @ Maastricht University.☆13Updated last year
- Ground Truth Resources for the HTR of patrimonial documents☆44Updated this week
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆323Updated last year
- A simple SCORM compliant wrapper that will enable you to run your own web based content in any LMS.☆55Updated 7 years ago
- Detect and align similar passages☆106Updated 2 months ago
- Annotato is a React component that helps to annotate or display and add interactivity to previously made annotations in a given text.☆12Updated last year
- Arethusa: Annotation Environment☆36Updated 2 years ago
- Javascript library for creating annotations in PDF documents☆602Updated 2 years ago
- Conversions between various OCR formats☆79Updated 2 years ago
- Software that makes labeling PDFs easy.☆416Updated last year
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated last year
- Encoding the Bible in TEI, starting with the Gospels☆24Updated 2 years ago
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 2 years ago
- An open tool for designing, building and managing interactive dialog systems☆268Updated 2 years ago