instituutnederlandsetaal / OpenConvertLinks
Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)
☆23Updated 3 years ago
Alternatives and similar repositories for OpenConvert
Users that are interested in OpenConvert are comparing it to the libraries listed below
Sorting:
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 7 years ago
- Kiln is a multi-platform framework for building and deploying complex websites whose source content is primarily in XML. It brings togeth…☆34Updated 3 years ago
- An experimental Python server for scholarly web annotations☆12Updated 3 years ago
- The original OxGarage is deprecated and replaced by TEIGarage☆20Updated 2 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆55Updated 2 months ago
- Repository for GitDOX, a GitHub Data-storage Online XML editor☆16Updated last month
- An implementation of the TEI Simple ODD extensions for processing models in XQuery.☆22Updated 6 years ago
- Text-Induced Corpus Clean-up☆20Updated 2 years ago
- Named entity annotation tool☆28Updated 2 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Transformation, web frontend, and API for lobid-organisations☆14Updated 5 months ago
- Awesome AI in Libraries☆16Updated 2 years ago
- OxGarage is an web, and RESTful, service to manage the transformation of documents between a variety of formats. The majority of transfor…☆53Updated 9 years ago
- An ontology for Linked Ancient World Data☆34Updated 9 years ago
- Automatic text comparison with an extendable variance classifier☆12Updated last year
- The main TEI Publisher app☆74Updated last week
- A module for Omeka S that provides an API for the Neatline 3 single page application☆15Updated 2 years ago
- Tools for TICCL☆14Updated last month
- Data space of the DARIAH Lexical Resources Working Group☆21Updated last month
- Digitale Geisteswissenschaften rund um Graphentechnologien☆8Updated 2 weeks ago
- ☆25Updated 4 years ago
- Metadata Quality Assessment Framework API☆18Updated this week
- Extract the MODS/ALTO metadata of a bunch of METS/ALTO files into pandas DataFrames for data analysis☆12Updated this week
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- ☆23Updated 7 years ago
- Web application to build XML stand-off markup☆15Updated 4 years ago
- ☆13Updated 2 weeks ago
- Python tools for performing various operations on ALTO XML files☆48Updated 5 months ago
- The CIS OCR PostCorrectionTool☆43Updated 2 years ago