sharavsambuu / english-mongolian-nmt-dataset-augmentation
Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google Translate service
☆18Updated 2 weeks ago
Alternatives and similar repositories for english-mongolian-nmt-dataset-augmentation:
Users that are interested in english-mongolian-nmt-dataset-augmentation are comparing it to the libraries listed below
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆32Updated 2 years ago
- Pre-trained Mongolian BERT models☆45Updated 4 years ago
- Useful resources for Mongolian NLP☆176Updated 2 months ago
- The Mongolian Wordnet (MonWN)☆17Updated 3 years ago
- Монгол үгийн алдаа шалгах толь, Mongolian spellchecking dictionary☆81Updated last week
- Pytorch-Named-Entity-Recognition-with-BERT☆15Updated 4 years ago
- ALBERT trained on Mongolian text corpus☆18Updated 4 years ago
- Lecture and seminar materials for Deep Learning summer school in Ulaanbaatar, 2019☆12Updated 3 years ago
- Lecture and seminar materials for Deep Learning summer school in Ulaanbaatar, 2021☆10Updated 3 years ago
- Myanmar Word Segmentation Tool☆29Updated 6 years ago
- Mongolian automated license plate recognition.☆13Updated 4 years ago
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆193Updated 4 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆40Updated last year
- ☆18Updated last year
- Abstractive Text Summarization using PyTorch☆14Updated 4 years ago
- Datasets and tools for basic natural language processing.☆375Updated 3 years ago
- Jupyter notebooks that use the Fastai library☆92Updated 3 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- A Python based API to access Indian language WordNets.☆39Updated 2 years ago
- Bolor tolidogch нь болор толь руу байнга орж үг хайх үйлдлийг хялбарчилсан chrome extension юм.☆22Updated last year
- BERT Question and Answer system meant and works well for only limited number of words summary like 1 to 2 paragraphs only. It can’t be ab…☆113Updated 3 years ago
- Generate large textual corpora for almost any language by crawling the web☆12Updated last year
- myPOS (Myanmar Part-of-Speech) Corpus for Myanmar NLP Research and Developments☆71Updated 2 months ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆206Updated 6 months ago
- Python library for Myanmar language☆34Updated last year
- TUFS Asian Language Parallel Corpus☆50Updated last year
- ☆32Updated 11 years ago
- The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.☆42Updated 5 years ago
- Automatically Score essays using Deep Learning☆147Updated 4 years ago
- OpusFilter - Parallel corpus processing toolkit☆104Updated 3 weeks ago