sharavsambuu / english-mongolian-nmt-dataset-augmentationLinks
Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google Translate service
☆18Updated 2 months ago
Alternatives and similar repositories for english-mongolian-nmt-dataset-augmentation
Users that are interested in english-mongolian-nmt-dataset-augmentation are comparing it to the libraries listed below
Sorting:
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆33Updated 2 years ago
- Useful resources for Mongolian NLP☆190Updated 9 months ago
- Arabic edition of BERT pretrained language models☆132Updated 4 years ago
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.☆72Updated last year
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆47Updated 4 years ago
- A paraphrase generator built using the T5 model which produces paraphrased English sentences.☆316Updated last month
- Automatically Score essays using Deep Learning☆151Updated 5 years ago
- Lecture and seminar materials for Deep Learning summer school in Ulaanbaatar, 2019☆12Updated 3 years ago
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- Machine Translation for Africa☆296Updated 3 years ago
- Hotels Arabic-Reviews Dataset☆32Updated 6 years ago
- Arabic support for textblob☆85Updated 3 years ago
- ☆111Updated last year
- ☆30Updated 5 years ago
- Arabic to English machine translation with Transformers and Pytorch☆23Updated 9 months ago
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆24Updated 4 years ago
- Myanmar Word Segmentation Tool☆31Updated 6 years ago
- Open source speech to text models for Indic Languages☆306Updated 3 years ago
- Arabic named entity recognition using AnerCorp corpus (location , organisation, person, Miscellaneous Word)☆37Updated 8 years ago
- Arabic Open Domain Question Answering System using Neural Reading Comprehension☆164Updated 2 years ago
- This repository contains examples of custom components for educational purposes.☆194Updated last year
- Jupyter notebooks that use the Fastai library☆92Updated 4 years ago
- Монгол үгийн алдаа шалгах толь, Mongolian spellchecking dictionary☆86Updated this week
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆185Updated 2 years ago
- Question Generation using Google T5 and Text2Text☆153Updated 4 years ago
- Crowd sourced training data for Rasa NLU models☆201Updated last year
- State of the Art Language models and Classifier for Bengali, which is primarily spoken by the Bengalis in South Asia.☆32Updated 5 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆111Updated last year
- myPOS (Myanmar Part-of-Speech) Corpus for Myanmar NLP Research and Developments☆77Updated 9 months ago
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆764Updated 2 months ago