acoli-repo / acoli-dicts
3000+ machine-readable open source dictionaries distributed by the Applied Computational Linguistics lab at the University of Augsburg, Germany, and by the research group Linked Open Dictionaries (LiODi, funded 2015-2020 by BMBF at Goethe University Frankfurt, Germany). All data provided in OntoLex-Lemon and TIAD-TSV.
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for acoli-dicts
- Morphological analyzer and lemmatizer for Latin.☆25Updated last week
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated this week
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated 2 weeks ago
- Annotation tool for coreference☆32Updated last year
- Multi Tier Annotation Search☆26Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- LexInfo - Data Category Ontology for OntoLex-Lemon☆22Updated last year
- A character-wise tokenizer for morphologically rich languages☆27Updated 5 months ago
- Aksharamukha Python Library☆43Updated last month
- Data for the HIPE 2022 shared task.☆16Updated 11 months ago
- TEI Reader Python Library☆16Updated 11 months ago
- A character-level BERT for Ancient Greek☆10Updated last year
- The Global WordNet Association Collaborative Inter-Lingual Index☆40Updated 2 weeks ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 3 years ago
- OCR post correction for old German corpus☆19Updated 2 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆16Updated this week
- A cloud-based, open-source system for writing and publishing dictionaries.☆86Updated 10 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆22Updated last year
- UIMA CAS processing library written in Python☆85Updated 6 months ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆70Updated this week
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆70Updated last week
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆15Updated 5 months ago
- Latin BERT☆57Updated 4 months ago
- The Open Multilingual Wordnet☆58Updated 6 months ago
- This packages up data for the Open Multilingual Wordnet☆43Updated 3 weeks ago
- Named entity annotation tool☆27Updated last year
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 3 months ago
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆32Updated this week
- These are lists for a variety of languages containing words that are distinctive to each language.☆34Updated 2 years ago
- Runnable morphological analysis tools from the UniMorph project☆14Updated 6 years ago