MarsPanther / Amharic-English-Machine-Translation-CorpusLinks
Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
☆43Updated 7 years ago
Alternatives and similar repositories for Amharic-English-Machine-Translation-Corpus
Users that are interested in Amharic-English-Machine-Translation-Corpus are comparing it to the libraries listed below
Sorting:
- Morphological processing for languages of the Horn of Africa☆52Updated 3 months ago
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)☆778Updated this week
- Facebook Low Resource (FLoRes) MT Benchmark☆757Updated 2 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆385Updated 2 years ago
- A curated list of research papers and resources on code-switching☆329Updated last year
- Lexical Data of Ge'ez Languages☆55Updated 3 years ago
- Massively multilingual pronunciation mining☆358Updated 4 months ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆173Updated last week
- Different semantic models for Amharic☆21Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 3 years ago
- Machine-Translation-based sentence alignment tool for parallel text☆313Updated 4 years ago
- The project aims on adding a state-of-the-art transliteration module for cross transliterations among all Indian languages including Engl…☆273Updated 3 years ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆288Updated 2 months ago
- A neural word aligner based on multilingual BERT☆363Updated 3 years ago
- Best Practices in Translation Memory Management☆47Updated 7 years ago
- ☆15Updated 4 years ago
- Datasets and tools for basic natural language processing.☆386Updated 4 years ago
- 📃Language Model based sentences scoring library☆308Updated 3 years ago
- Improved Sentence Alignment in Linear Time and Space☆186Updated 2 years ago
- Code for the ACL 2020 Paper on Schwa Deletion in Hindi and Punjabi☆16Updated 2 years ago
- Bitextor generates translation memories from multilingual websites☆299Updated last year
- Python port of Moses tokenizer, truecaser and normalizer☆495Updated last year
- A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.☆510Updated 2 months ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆159Updated last year
- ☆45Updated 3 years ago
- Machine Translation for Africa☆303Updated 3 years ago
- Sentence aligner☆122Updated 4 years ago
- Resources and tools for Indian language Natural Language Processing☆619Updated last year
- Simple, fast unsupervised word aligner☆765Updated 3 years ago