maobedkova / AmharicCorpus
The set of files used for the development of the Amharic Corpus.
☆11Updated 7 years ago
Related projects: ⓘ
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Updated 7 years ago
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆20Updated 2 years ago
- HORNMORPHO is a Python program that analyzes Amharic, Oromo, and Tigrinya words into their constituent morphemes (meaningful parts) and g…☆18Updated 6 years ago
- Amharic/Tigrinya/Oromo Dictionaries☆36Updated last year
- Morphological processing for languages of the Horn of Africa☆39Updated this week
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆14Updated 4 years ago
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.☆11Updated 2 weeks ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- A library for generating Ethiopic fake data such as names, addresses, and phone numbers☆16Updated 6 years ago
- ☆61Updated 4 months ago
- CONLL-U to Pandas DataFrame☆30Updated 6 years ago
- ☆23Updated 7 years ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆63Updated last year
- Utility scripts in Python☆37Updated last month
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 7 years ago
- ☆10Updated 7 years ago
- ☆14Updated 6 years ago
- universal syllabification algorithms☆43Updated last year
- Featurize words into orthographic and phonological vectors.☆39Updated last year
- ☆43Updated 9 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated last year
- Command-line corpus tools☆9Updated 7 years ago
- Runnable morphological analysis tools from the UniMorph project☆14Updated 5 years ago
- The Arborator software is aimed at collaboratively annotating dependency corpora.☆24Updated 4 years ago
- How (but not why) to do Twitter sociolinguistic analysis in the Unix Shell☆10Updated 8 years ago
- A natural language processing tool for automatically detecting quotations in text.☆15Updated 2 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- Software for phonetic transcription of English and Finnish, and IPA tools☆15Updated 8 years ago
- Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.☆37Updated 6 years ago