martinreynaert / TICCLLinks
Text-Induced Corpus Clean-up
☆20Updated 2 years ago
Alternatives and similar repositories for TICCL
Users that are interested in TICCL are comparing it to the libraries listed below
Sorting:
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 7 years ago
- Tools for TICCL☆14Updated 2 weeks ago
- ☆16Updated 10 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆61Updated 7 years ago
- Named entity annotation tool☆28Updated 2 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆28Updated 4 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 3 years ago
- OCRopus model for Gothic print (Fraktur)☆19Updated 5 years ago
- ☆11Updated 5 years ago
- Process, enhance and evaluate multiple OCR output.☆24Updated 3 weeks ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Updated 3 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 4 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 3 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated 2 weeks ago
- ANNotation Infrastructure using Finna: an automatic subject indexing tool using Finna as corpus☆15Updated 7 years ago
- IXA pipes Part of Speech tagger and Lemmatizer (http://ixa2.si.ehu.es/ixa-pipes)☆18Updated 3 years ago
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Updated 3 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 6 years ago
- a CLI suggestion tool for Wikidata entities☆30Updated 9 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…☆24Updated 11 months ago
- Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)☆16Updated 3 years ago
- Repository for the book Among Digitized Manuscripts by L.W. Cornelis van Lit (Leiden: Brill, 2020)☆25Updated 5 years ago
- The CIS OCR PostCorrectionTool☆44Updated 3 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆58Updated 3 months ago
- Web application for transcribing OCR ground truth from Archive.org☆17Updated 7 years ago
- ☆32Updated 3 years ago
- Python API for KB data-services☆19Updated 5 years ago
- Legacy Repository: TEI SimplePrint now merged into TEI Repository. Originally TEI Simple aimed to define a new highly-constrained and pr…☆49Updated 9 years ago