sharavsambuu / english-mongolian-nmt-dataset-augmentationLinks
Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google Translate service
☆18Updated 5 months ago
Alternatives and similar repositories for english-mongolian-nmt-dataset-augmentation
Users that are interested in english-mongolian-nmt-dataset-augmentation are comparing it to the libraries listed below
Sorting:
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆33Updated 2 years ago
- Pre-trained Mongolian BERT models☆47Updated 4 years ago
- We use LSTM, BiLSTM, BERT and SVM with TF-IDF, Word2vec and Bag-of-words to classify this documents to positive (labeled as 1), neutral (…☆33Updated 2 years ago
- RASA based voice bot after 1 months jump in to AI ;)☆29Updated 6 years ago
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.☆72Updated last year
- BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)☆87Updated last year
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- Vietnamese language model for spacy.io☆114Updated 2 years ago
- ☆33Updated 12 years ago
- A Large-scale Vietnamese News Text Classification Corpus☆104Updated 6 years ago
- Compilation of Manually Tagged Roman Urdu Dataset (Urdu written in Latin/Roman Script), along with other helpful Roman Urdu NLP resources☆34Updated 5 years ago
- Arabic edition of BERT pretrained language models☆132Updated 4 years ago
- ☆16Updated 5 years ago
- My Notes on Tensorflow Dev Summit 2020☆13Updated 5 years ago
- Neural Machine Translation system for English to Vietnamese (IWSLT'15 English-Vietnamese data)☆62Updated 6 years ago
- myPOS (Myanmar Part-of-Speech) Corpus for Myanmar NLP Research and Developments☆78Updated 2 months ago
- Automatically Score essays using Deep Learning☆151Updated 5 years ago
- ☆115Updated last month
- A Feature-based Vietnamese Named-Entity Recognition Model☆31Updated 6 years ago
- Submission for AIviVN Vietnamese diacritics restoration contest https://www.aivivn.com/contests/3☆40Updated last year
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆201Updated 5 years ago
- Text Summarization for Research Papers☆78Updated 3 years ago
- The English-Vietnamese Bilingual Corpus (EVBCorpus) is a collection of English and Vietnamese parallel translations and bitexts.☆47Updated 6 years ago
- Монгол үгийн алдаа шалгах толь, Mongolian spellchecking dictionary☆87Updated 3 weeks ago
- An ensemble system with a search engine for relevant document retrieval and a deep learning model (BERT) for machine comprehension in Vie…☆14Updated 6 years ago
- A Python wrapper for VnCoreNLP using a bidirectional communication channel.☆57Updated 7 years ago
- PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)☆150Updated 11 months ago
- ☆15Updated 4 years ago
- Khmer unicode text data for unsupervised learning language model☆25Updated 4 years ago
- Keras implementation of character-level sequence-to-sequence learning for spelling correction☆73Updated 6 years ago