martinreynaert / TICCLView external linksLinks
Text-Induced Corpus Clean-up
☆20Jun 20, 2023Updated 2 years ago
Alternatives and similar repositories for TICCL
Users that are interested in TICCL are comparing it to the libraries listed below
Sorting:
- Tools for TICCL☆14Dec 12, 2025Updated 2 months ago
- ☆11Updated this week
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 5 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 7 years ago
- ☆16Jan 7, 2026Updated last month
- FoLiA library for C++☆17Dec 11, 2025Updated 2 months ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago
- The CIS OCR PostCorrectionTool☆44Nov 7, 2022Updated 3 years ago
- Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)☆18May 29, 2022Updated 3 years ago
- View HOCR files with Mirador☆29Sep 27, 2017Updated 8 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Sep 7, 2022Updated 3 years ago
- Web service for creating and hosting IIIF manifests from METS/MODS documents☆36Dec 8, 2022Updated 3 years ago
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Feb 11, 2022Updated 4 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- Project to digitize avant-garde periodicals☆12May 13, 2022Updated 3 years ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Mar 13, 2019Updated 6 years ago
- Project between GitHub, figshare and Mozilla Science Lab.☆67Jul 19, 2019Updated 6 years ago
- Digitale Geisteswissenschaften rund um Graphentechnologien☆10Updated this week
- An implementation of the TEI Simple ODD extensions for processing models in XQuery.☆22Jul 24, 2019Updated 6 years ago
- ☆11Dec 31, 2020Updated 5 years ago
- A structured list of text corpora, created for use with a corpus downloader.☆13Aug 27, 2016Updated 9 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Dec 8, 2022Updated 3 years ago
- Modeling and visualizing physical manuscript collation☆53Sep 7, 2022Updated 3 years ago
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- Docker containers for running VIVO☆13Oct 26, 2016Updated 9 years ago
- interactive, customizable semantic web visualization☆15Dec 27, 2025Updated last month
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Mar 27, 2023Updated 2 years ago
- An experimental Python server for scholarly web annotations☆12Sep 8, 2021Updated 4 years ago
- Rails application supporting the creation of OCR and the IIIF Content Search API☆34Dec 14, 2022Updated 3 years ago
- Django web application to display, annotate, and export digitized books.☆33Feb 3, 2026Updated last week
- An attempt to document the API between Noxon iRadio devices and the servers at vtuner.com and my-noxon.net☆13Sep 8, 2022Updated 3 years ago
- Script for generating a new Pleiades+ CSV file☆16Mar 3, 2020Updated 5 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Nov 3, 2017Updated 8 years ago
- Scripts and configuration for converting MARC bibliographic records into RDF☆32Jun 5, 2025Updated 8 months ago
- T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf☆19May 28, 2025Updated 8 months ago
- A simple Java application for managing an OAI-PMH harvesting workflow☆14Jan 18, 2026Updated 3 weeks ago
- 'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata☆13Jan 13, 2016Updated 10 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- ☆13May 16, 2019Updated 6 years ago