sharavsambuu / english-mongolian-nmt-dataset-augmentationLinks
Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google Translate service
☆18Updated 4 months ago
Alternatives and similar repositories for english-mongolian-nmt-dataset-augmentation
Users that are interested in english-mongolian-nmt-dataset-augmentation are comparing it to the libraries listed below
Sorting:
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆33Updated 2 years ago
- Useful resources for Mongolian NLP☆191Updated 10 months ago
- Pre-trained Mongolian BERT models☆47Updated 4 years ago
- Mongolian speech recognition with PyTorch☆136Updated 4 years ago
- Automatically Score essays using Deep Learning☆151Updated 5 years ago
- Crowd sourced training data for Rasa NLU models☆203Updated last year
- Open source speech to text models for Indic Languages☆309Updated 3 years ago
- Lecture and seminar materials for Deep Learning summer school in Ulaanbaatar, 2021☆10Updated 4 years ago
- ☆114Updated 3 weeks ago
- Named Entity Recognition in Nepali Language☆11Updated 2 years ago
- This repository contains examples of custom components for educational purposes.☆196Updated last year
- Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagg…☆936Updated last year
- This repository contains a few simple projects with forms.☆49Updated 3 years ago
- Question Generation using Google T5 and Text2Text☆153Updated 4 years ago
- ☆27Updated 6 years ago
- The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.☆46Updated 6 years ago
- Use Rasa to build a FAQ bot☆75Updated 3 years ago
- A demo for a financial services bot☆322Updated 2 months ago
- BERT Question and Answer system meant and works well for only limited number of words summary like 1 to 2 paragraphs only. It can’t be ab…☆114Updated 4 years ago
- Pytorch-Named-Entity-Recognition-with-BERT☆15Updated 5 years ago
- Visualizations and helpers to improve and debug machine learning models for Rasa Open Source☆311Updated 3 years ago
- A simple approach to use GPT2-medium (345M) for generating high quality text summaries with minimal training.☆156Updated 2 years ago
- Paraphrase any question with T5 (Text-To-Text Transfer Transformer) - Pretrained model and training script provided☆185Updated 2 years ago
- Automated Essay Scoring on The Hewlett Foundation dataset on Kaggle☆32Updated 7 years ago
- Datasets and tools for basic natural language processing.☆386Updated 4 years ago
- ☆197Updated last year
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆201Updated 5 years ago
- The kinyarwanda model for deepspeech☆16Updated 4 years ago
- ☆125Updated 4 years ago
- RASA based voice bot after 1 months jump in to AI ;)☆30Updated 6 years ago