LanguageMachines / ticcltools
Tools for TICCL
☆14Updated 2 months ago
Alternatives and similar repositories for ticcltools:
Users that are interested in ticcltools are comparing it to the libraries listed below
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 3 years ago
- Named Entity Recognition tool for Europeana Newspapers☆14Updated 6 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 5 years ago
- Text-Induced Corpus Clean-up☆20Updated last year
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Process, enhance and evaluate multiple OCR output.☆22Updated 3 months ago
- ☆11Updated last month
- A web-based, token-level annotation tool for non-standard language data☆10Updated 4 years ago
- All ontologies used in NIF 2.0 (NIF-Core + vocabulary modules + helper modules)☆36Updated 7 years ago
- Specification of NAF, the NLP annotation format☆21Updated 4 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- 🕸 YALC: Yet Another LOD Cloud (registry of Linked Open Datasets).☆15Updated last year
- DBpedia, which frequently crawls and analyses over 120 Wikipedia language editions has near complete information about (1) which facts ar…☆10Updated 2 years ago
- Linked Open Vocabularies (LOV) - scripts☆9Updated 8 years ago
- Collections of english historical texts and data relating to them☆18Updated 3 years ago
- ☆16Updated 9 years ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Updated 4 months ago
- ☆14Updated 3 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated last year
- Linked SDMX☆17Updated 10 years ago
- Web Tables Automatic Property Mapping☆7Updated 5 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆61Updated 9 months ago
- Named entity annotation tool☆27Updated last year
- Python API for KB data-services☆19Updated 5 years ago
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13Updated 4 months ago
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Updated 6 months ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago