MarsPanther / Amharic-English-Machine-Translation-CorpusLinks
Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
☆43Updated 7 years ago
Alternatives and similar repositories for Amharic-English-Machine-Translation-Corpus
Users that are interested in Amharic-English-Machine-Translation-Corpus are comparing it to the libraries listed below
Sorting:
- Morphological processing for languages of the Horn of Africa☆46Updated 2 weeks ago
- Resources and tools for Indian language Natural Language Processing☆602Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 2 years ago
- Python package for indic script transliteration☆192Updated this week
- The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including Engl…☆271Updated 2 years ago
- Crawler for linguistic corpora☆208Updated 2 weeks ago
- A Python based API to access Indian language WordNets.☆38Updated 3 years ago
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)☆747Updated last month
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆109Updated last year
- Fast and accurate spell correction library☆81Updated 3 years ago
- Datasets and tools for basic natural language processing.☆386Updated 3 years ago
- Facebook Low Resource (FLoRes) MT Benchmark☆753Updated last year
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆41Updated 2 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆47Updated 2 years ago
- Punctuation restoration and spell correction experiments.☆250Updated 4 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated last year
- A sentence segmenter that actually works!☆305Updated 5 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆451Updated last year
- ✔️Contextual word checker for better suggestions (not actively maintained)☆417Updated 7 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 2 months ago
- Efficient Low-Memory Aligner☆146Updated 7 months ago
- A curated list of research papers and resources on code-switching☆323Updated 8 months ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆375Updated last year
- Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi☆16Updated last year
- Resources to go with the Indic NLP Library☆75Updated 3 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆197Updated 4 years ago
- ☆20Updated 3 years ago
- Benchmark Arabic text diacritization dataset☆75Updated 6 years ago
- A tool for converting TMX files into bilingual corpora☆18Updated 5 years ago