LanguageMachines / ticcltools
Tools for TICCL
☆14Updated last month
Related projects ⓘ
Alternatives and complementary repositories for ticcltools
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 3 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- Text-Induced Corpus Clean-up☆20Updated last year
- Parser for KAF NAF files written in Python☆16Updated 3 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 6 years ago
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13Updated last month
- OCRopus model for Gothic print (Fraktur)☆18Updated 4 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆24Updated 2 years ago
- Process, enhance and evaluate multiple OCR output.☆20Updated 2 weeks ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- Collections of english historical texts and data relating to them☆18Updated 3 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Updated 3 weeks ago
- 🕸 YALC: Yet Another LOD Cloud (registry of Linked Open Datasets).☆14Updated last year
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆60Updated 5 months ago
- A web-based, token-level annotation tool for non-standard language data☆10Updated 4 years ago
- ☆17Updated 9 years ago
- Named entity annotation tool☆27Updated last year
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 2 years ago
- Ontolex modules☆30Updated last year
- All ontologies used in NIF 2.0 (NIF-Core + vocabulary modules + helper modules)☆36Updated 7 years ago
- Example SPARQL queries, mostly for working with ZBW data sets☆15Updated 2 months ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- NLP pipeline software using common workflow language☆34Updated 5 years ago
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Updated 2 months ago
- Python API for KB data-services☆18Updated 4 years ago
- Pikes is a Knowledge Extraction Suite☆23Updated 11 months ago
- Linked Data explorer and SPARQL endpoint☆23Updated 2 years ago
- Project to digitize avant-garde periodicals☆12Updated 2 years ago
- Specification of NAF, the NLP annotation format☆21Updated 3 years ago