maiaPhilippe / pdf-to-text
PDF OCR using Pure Javascript by tesseract.js api
☆21Updated 7 years ago
Alternatives and similar repositories for pdf-to-text
Users that are interested in pdf-to-text are comparing it to the libraries listed below
Sorting:
- Script that sets up and configures an entire CQPweb server installation☆11Updated 5 years ago
- Annotation layer for pdf.js☆282Updated 7 months ago
- Working with hOCR in Javascript☆127Updated 2 years ago
- File format, model, API, and apps for manipulating text and its annotated features☆70Updated 2 weeks ago
- I created this repository to provide the DH Community a compilation of free, open-source tools for creating and developing digital humani…☆36Updated last year
- Annotate entities directly onto a PDF with automatic OCR for scanned PDFs☆59Updated last year
- Arethusa: Annotation Environment☆35Updated 2 years ago
- Ergonomic line-by-line transcription of scanned text.☆51Updated 4 years ago
- Receipt scanner extracts information from your PDF or image receipts - built in NodeJS☆299Updated 6 years ago
- An online annotation platform for teaching and learning in the humanities.☆108Updated 3 months ago
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 4 years ago
- Extract case law citations with Node☆57Updated 11 years ago
- Ancient Greek language models for spaCy☆29Updated 2 months ago
- Yet another search platform for linguistic corpora.☆25Updated this week
- Data Store for Annotation Studio☆46Updated 2 years ago
- Annotation layer for PDF.js. Forked and modified from Submitty's branch.☆15Updated 2 years ago
- guides and test data for OCR4all☆30Updated 2 years ago
- Find legal citations in any block of text☆150Updated this week
- A tool to help quickly generate draft interviews from an existing document (pdf or DOCX) for the docassemble platform.☆23Updated 10 months ago
- Quickly go from a paper court form to a runnable, guided, step-by-step web application powered by Docassemble. Swap out branding and pre-…☆49Updated this week
- ☆32Updated 2 years ago
- Pipeline for the production of digital scholarly editions of archival collections☆12Updated last year
- Tool to OCR PDFs using Google Cloud Vision☆42Updated 2 years ago
- ☆92Updated this week
- 🏖TagEditor - Annotation tool for spaCy☆193Updated 2 years ago
- Get annotation suggestions for the INCEpTION text annotation platform from spaCy, Sentence BERT, scikit-learn and more. Runs as a web-ser…☆45Updated 7 months ago
- CollectionBuilder-CSV is a "stand alone" template for creating digital collection and exhibit websites using Jekyll and a metadata CSV.☆26Updated 2 weeks ago
- Easily build and maintain any kind of contract. Free and Open Source☆94Updated 7 years ago
- PDF.js + Hypothesis viewer / annotator☆388Updated 3 months ago
- Rezonator: Dynamics of human engagement☆35Updated 6 months ago