sharavsambuu / english-mongolian-nmt-dataset-augmentationLinks
Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google Translate service
☆18Updated 6 months ago
Alternatives and similar repositories for english-mongolian-nmt-dataset-augmentation
Users that are interested in english-mongolian-nmt-dataset-augmentation are comparing it to the libraries listed below
Sorting:
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆33Updated 3 years ago
- Pre-trained Mongolian BERT models☆49Updated 4 years ago
- Useful resources for Mongolian NLP☆194Updated last year
- The Mongolian Wordnet (MonWN)☆17Updated 4 years ago
- Text2Text Language Modeling Toolkit☆304Updated 11 months ago
- Arabic edition of BERT pretrained language models☆132Updated 5 years ago
- We gather Malaysian dataset! https://malaysian-dataset.readthedocs.io/☆328Updated this week
- ☆25Updated last year
- ☆23Updated last year
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆784Updated 5 months ago
- Crowd sourced training data for Rasa NLU models☆201Updated 2 years ago
- BERT Question and Answer system meant and works well for only limited number of words summary like 1 to 2 paragraphs only. It can’t be ab…☆114Updated 4 years ago
- Python library for Myanmar language☆38Updated last year
- Machine Translation for Africa☆303Updated 3 years ago
- A morphosyntactic analyzer for the Arabic language.☆24Updated 5 years ago
- Facebook Low Resource (FLoRes) MT Benchmark☆757Updated 2 years ago
- This repository contains examples of custom components for educational purposes.☆196Updated 2 years ago
- Automatically Score essays using Deep Learning☆150Updated 5 years ago
- RASA based voice bot after 1 months jump in to AI ;)☆29Updated 6 years ago
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆32Updated 3 months ago
- A seq2seq model that can correct spelling mistakes.☆216Updated 8 years ago
- Yorùbá language training text for NLP, ASR and TTS tasks☆81Updated 2 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- Arabic Open Domain Question Answering System using Neural Reading Comprehension☆164Updated 2 years ago
- This is a conversational bot that can be used in the telecom sector for automating the voice bot process with the help of this Rasa Chatb…☆29Updated last year
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.☆72Updated last year
- Myanmar Word Segmentation Tool☆32Updated 7 years ago
- An NLP system for generating reading comprehension questions☆298Updated last year
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆174Updated 2 weeks ago
- Data Augmentation by Backtranslation (DAB) ヽ( •_-)ᕗ☆68Updated 3 years ago