MarsPanther / Amharic-English-Machine-Translation-CorpusLinks
Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
☆43Updated 7 years ago
Alternatives and similar repositories for Amharic-English-Machine-Translation-Corpus
Users that are interested in Amharic-English-Machine-Translation-Corpus are comparing it to the libraries listed below
Sorting:
- Morphological processing for languages of the Horn of Africa☆49Updated 3 weeks ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 3 years ago
- A curated list of research papers and resources on code-switching☆326Updated 10 months ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆278Updated 4 months ago
- Datasets and tools for basic natural language processing.☆386Updated 4 years ago
- ☆24Updated 4 years ago
- A module for normalising text.☆173Updated 3 years ago
- ☆45Updated 3 years ago
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)☆755Updated this week
- Massively multilingual pronunciation mining☆352Updated last month
- ☆45Updated 7 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆377Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) script☆226Updated last year
- Punctuation restoration and spell correction experiments.☆251Updated 4 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆118Updated 2 years ago
- 📃Language Model based sentences scoring library☆309Updated 3 years ago
- State of the Art Language models and Classifier for Kannada, which is spoken predominantly by Kannada people in India, mainly in the stat…☆34Updated 5 years ago
- This is a Pytorch (+ Huggingface transformers) implementation of a "simple" text classifier defined using BERT-based models. In this lab …☆19Updated 4 years ago
- Python Finite-State Toolkit☆58Updated this week
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆109Updated 4 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆51Updated 2 years ago
- ☆54Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆111Updated last year
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆221Updated last year
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Updated 5 years ago
- Python library for converting numbers to words for all Indian Languages.☆36Updated 4 months ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆35Updated 2 months ago
- Automatic Mapping of Disfluency Annotations for corrected version of Switchboard☆17Updated 6 years ago
- PHOIBLE data and development.☆134Updated last year
- Different semantic models for Amharic☆21Updated last year