instituutnederlandsetaal / OpenConvertLinks
Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)
☆23Updated 3 years ago
Alternatives and similar repositories for OpenConvert
Users that are interested in OpenConvert are comparing it to the libraries listed below
Sorting:
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 4 years ago
- Text-Induced Corpus Clean-up☆20Updated 2 years ago
- An implementation of the TEI Simple ODD extensions for processing models in XQuery.☆22Updated 6 years ago
- Named entity annotation tool☆28Updated 2 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆57Updated last month
- Automatic text comparison with an extendable variance classifier☆12Updated 2 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 7 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆61Updated 7 years ago
- The CIS OCR PostCorrectionTool☆44Updated 2 years ago
- Kiln is a multi-platform framework for building and deploying complex websites whose source content is primarily in XML. It brings togeth…☆34Updated 3 years ago
- CollateX – Software for Collating Textual Sources☆96Updated last year
- A repository for illustrating the transformation of a PAGE XML file into XML-TEI format, resulting from experimentations made for the LEC…☆16Updated 3 years ago
- Tools for TICCL☆14Updated last month
- An experimental Python server for scholarly web annotations☆12Updated 4 years ago
- ☆11Updated 5 years ago
- The original OxGarage is deprecated and replaced by TEIGarage☆20Updated 3 years ago
- Python tools for performing various operations on ALTO XML files☆48Updated 8 months ago
- The main TEI Publisher app☆78Updated this week
- High-performance text aligner for large collections of texts☆53Updated this week
- Web application to build XML stand-off markup☆15Updated 4 years ago
- Conversions between various OCR formats☆81Updated 2 years ago
- ☆25Updated 4 years ago
- Transformation, web frontend, and API for lobid-organisations☆13Updated 2 months ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 3 years ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated 2 years ago
- Hymir is a Java based IIIF Server. It is based on "IIIF Image API Java Libraries" and "IIIF Presentation API Java Libraries" projects (se…☆31Updated this week
- Web application for transcribing OCR ground truth from Archive.org☆17Updated 7 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 3 years ago