ultrasaurus / hocr-javascript
JS for overlaying OCR on image using HOCR formatted HTML
☆23Updated 8 years ago
Related projects: ⓘ
- View HOCR files with Mirador☆26Updated 6 years ago
- generic JSON-LD editor☆32Updated 7 years ago
- Tools for TICCL☆14Updated last week
- Process, enhance and evaluate multiple OCR output.☆20Updated 5 years ago
- API implementation, User Interface, and more modules of the IPTC EXTRA project☆12Updated 2 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆34Updated last year
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Updated last month
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 6 years ago
- A Python wrapper for the nascent hypothes.is web API☆11Updated 6 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 7 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆51Updated 2 months ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 5 years ago
- Web-based page layout editor created for EMOP (Early Modern OCR Project).☆11Updated 3 years ago
- Crop And Splice Segments (of scanned pages)☆14Updated 5 years ago
- LevelGraph.io Playground☆11Updated 2 years ago
- Application to turn SPARQL queries into APIs and use them in a simple Web app (Express + Vue)☆9Updated 2 years ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 4 years ago
- Create GraphQL schemas from RDF ontologies☆28Updated 5 years ago
- Ergonomic line-by-line transcription of scanned text.☆47Updated 3 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 3 years ago
- Wrapper for the kraken OCR engine☆10Updated 3 weeks ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- A highly adaptable HTML5 only component that enables interactive access and search data from a SPARQL endpoint.☆10Updated 11 months ago
- Specification of the extraction/transformation of Microdata content to RDF☆15Updated 3 years ago
- Working with hOCR in Javascript☆119Updated last year
- Repository for GitDOX, a GitHub Data-storage Online XML editor☆15Updated 5 months ago
- A thin GraphQL wrapper around spacy☆21Updated 4 years ago
- Named entity annotation tool☆27Updated last year
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆24Updated 2 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆37Updated 6 months ago