acoli-repo / acoli-dicts
3000+ machine-readable open source dictionaries distributed by the Applied Computational Linguistics lab at the University of Augsburg, Germany, and by the research group Linked Open Dictionaries (LiODi, funded 2015-2020 by BMBF at Goethe University Frankfurt, Germany). All data provided in OntoLex-Lemon and TIAD-TSV.
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for acoli-dicts
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated last year
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated this week
- Named entity annotation tool☆27Updated last year
- Morphological analyzer and lemmatizer for Latin.☆25Updated last week
- Annotation tool for coreference☆32Updated last year
- Advanced graph rewriting and LLOD publication for CoNLL and other TSV formats☆25Updated 5 months ago
- LexInfo - Data Category Ontology for OntoLex-Lemon☆22Updated last year
- Aksharamukha Python Library☆43Updated 3 weeks ago
- Data for the HIPE 2022 shared task.☆15Updated 11 months ago
- This packages up data for the Open Multilingual Wordnet☆43Updated last week
- Multi Tier Annotation Search☆26Updated 3 years ago
- Latin BERT☆56Updated 4 months ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆17Updated 5 months ago
- ☆20Updated last month
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated 6 months ago
- CLDF: Cross-Linguistic Data Formats - the specification☆55Updated 6 months ago
- Ancient Greek language models for spaCy☆24Updated 3 months ago
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆69Updated last week
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- ☆32Updated last year
- Runnable morphological analysis tools from the UniMorph project☆14Updated 5 years ago
- A character-wise tokenizer for morphologically rich languages☆27Updated 4 months ago
- ☆63Updated 5 months ago
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆32Updated last week
- The Open Multilingual Wordnet☆58Updated 6 months ago
- Named Entity Recognition☆16Updated this week
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- OCR post correction for old German corpus☆19Updated 2 years ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆70Updated last week