maobedkova / AmharicCorpus
The set of files used for the development of the Amharic Corpus.
☆11Updated 7 years ago
Alternatives and similar repositories for AmharicCorpus:
Users that are interested in AmharicCorpus are comparing it to the libraries listed below
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Updated 7 years ago
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆19Updated 2 years ago
- Amharic/Tigrinya/Oromo Dictionaries☆37Updated last year
- HORNMORPHO is a Python program that analyzes Amharic, Oromo, and Tigrinya words into their constituent morphemes (meaningful parts) and g…☆19Updated 7 years ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Updated 4 years ago
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- simple bs4 based web crawl for a corpus in need of statistical machine translation☆13Updated 3 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- A simple configurable tool for manipulating dependency trees.☆13Updated 3 weeks ago
- ☆23Updated 7 years ago
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.☆12Updated 4 months ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 7 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆66Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated 11 months ago
- Python framework for processing Universal Dependencies data☆56Updated 3 weeks ago
- Morphological processing for languages of the Horn of Africa☆43Updated this week
- PredPatt: Predicate-Argument Extraction from Universal Dependencies☆112Updated 3 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 5 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆41Updated 2 months ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆64Updated last year
- Software for multi-level annotation of linguistic corpora☆17Updated 5 years ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 7 months ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- This repository contains the Framester resource, the main outcome of the framester project.☆34Updated 4 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆43Updated 4 years ago
- KenLM extension for spaCy 2.0.☆16Updated 7 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Updated 2 years ago
- Distillation of Ensemble Dependency Parsers into a Single Graph-Based Parser☆11Updated 8 years ago