LanguageMachines / ticcltoolsLinks
Tools for TICCL
☆14Updated last month
Alternatives and similar repositories for ticcltools
Users that are interested in ticcltools are comparing it to the libraries listed below
Sorting:
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 2 years ago
- Text-Induced Corpus Clean-up☆20Updated 2 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 4 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Updated last year
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 7 years ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- Parser for KAF NAF files written in Python☆16Updated 4 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆60Updated 7 years ago
- ☆14Updated 3 years ago
- Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)☆16Updated 3 years ago
- A web-based, token-level annotation tool for non-standard language data☆10Updated 4 years ago
- ☆13Updated last month
- Process, enhance and evaluate multiple OCR output.☆22Updated 8 months ago
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Updated 9 years ago
- Collections of english historical texts and data relating to them☆18Updated 4 years ago
- All ontologies used in NIF 2.0 (NIF-Core + vocabulary modules + helper modules)☆37Updated 8 years ago
- Named entity annotation tool☆28Updated 2 years ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Updated 8 months ago
- Specification of NAF, the NLP annotation format☆21Updated 4 years ago
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…☆24Updated 6 months ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- A simple Java application for managing an OAI-PMH harvesting workflow☆14Updated 2 months ago
- JournalTouch provides a touch-optimized interface for browsing current journal tables of contents in Responsive Design. Fun!☆14Updated 6 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 6 years ago
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Updated 2 months ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- ☆16Updated 10 years ago
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Updated 3 years ago