sharavsambuu / english-mongolian-nmt-dataset-augmentationLinks
Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google Translate service
☆18Updated 4 months ago
Alternatives and similar repositories for english-mongolian-nmt-dataset-augmentation
Users that are interested in english-mongolian-nmt-dataset-augmentation are comparing it to the libraries listed below
Sorting:
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆32Updated 2 years ago
- Pre-trained Mongolian BERT models☆46Updated 4 years ago
- Useful resources for Mongolian NLP☆184Updated 5 months ago
- Mongolian speech recognition with PyTorch☆134Updated 4 years ago
- The Mongolian Wordnet (MonWN)☆17Updated 3 years ago
- Монгол үгийн алдаа шалгах толь, Mongolian spellchecking dictionary☆85Updated 2 months ago
- Pytorch-Named-Entity-Recognition-with-BERT☆15Updated 4 years ago
- Benchmark Arabic text diacritization dataset☆75Updated 5 years ago
- Arabic edition of BERT pretrained language models☆129Updated 4 years ago
- A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…☆26Updated 11 years ago
- Python transliteration library (mostly from non-latin scripts, such as Arabic, Japanese, etc.)☆20Updated 6 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆14Updated 3 years ago
- ☆30Updated 5 years ago
- Banking chatbot based on Rasa open source machine learning tools for developers to create contextual AI assistants and chatbots that go b…☆19Updated 5 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆26Updated 2 years ago
- ☆43Updated 9 years ago
- ☆42Updated 2 years ago
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆24Updated 4 years ago
- Text to Speech with PyTorch (English and Mongolian)☆184Updated 8 months ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆80Updated 2 weeks ago
- A morphosyntactic analyzer for the Arabic language.☆23Updated 5 years ago
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- ☆40Updated last month
- Lecture and seminar materials for Deep Learning summer school in Ulaanbaatar, 2019☆12Updated 3 years ago
- Arabic named entity recognition using AnerCorp corpus (location , organisation, person, Miscellaneous Word)☆37Updated 7 years ago
- This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on…☆42Updated last year
- Arabic support for textblob☆85Updated 3 years ago
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆47Updated 4 years ago
- Arabic Open Domain Question Answering System using Neural Reading Comprehension☆165Updated last year
- Pre-process arabic text (remove diacritics, punctuations and repeating characters)☆106Updated 8 years ago