sharavsambuu / english-mongolian-nmt-dataset-augmentationLinks
Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google Translate service
☆18Updated last month
Alternatives and similar repositories for english-mongolian-nmt-dataset-augmentation
Users that are interested in english-mongolian-nmt-dataset-augmentation are comparing it to the libraries listed below
Sorting:
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆33Updated 2 years ago
- Pre-trained Mongolian BERT models☆47Updated 4 years ago
- Useful resources for Mongolian NLP☆189Updated 7 months ago
- Mongolian speech recognition with PyTorch☆135Updated 4 years ago
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.☆72Updated last year
- The Mongolian Wordnet (MonWN)☆18Updated 3 years ago
- Machine Translation for Africa☆292Updated 3 years ago
- Arabic edition of BERT pretrained language models☆130Updated 4 years ago
- Open source speech to text models for Indic Languages☆306Updated 2 years ago
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆47Updated 4 years ago
- Datasets and tools for basic natural language processing.☆386Updated 3 years ago
- A morphosyntactic analyzer for the Arabic language.☆23Updated 5 years ago
- Dvoice est un outil de reconnaissance vocale pour les dialectes et les langues peu représentées.☆32Updated 3 years ago
- ☆42Updated 2 years ago
- Automatically Score essays using Deep Learning☆150Updated 5 years ago
- We gather Malaysian dataset! https://malaysian-dataset.readthedocs.io/☆322Updated this week
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year
- ☆30Updated 5 years ago
- myPOS (Myanmar Part-of-Speech) Corpus for Myanmar NLP Research and Developments☆74Updated 8 months ago
- State of the Art Language models and Classifier for Nepali, which is official language of Nepal and one of the official status gained lan…☆29Updated 5 years ago
- A simple Flask website for all NLP tasks which includes Text Preprocessing, Keyword Extraction, Text Summarization etc. Created Date: 30 …☆70Updated 3 years ago
- ☆18Updated 5 years ago
- ☆193Updated last year
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆760Updated 2 weeks ago
- End to end Arabic TTS system based on tacotron☆120Updated last year
- Compilation of Manually Tagged Roman Urdu Dataset (Urdu written in Latin/Roman Script), along with other helpful Roman Urdu NLP resources☆33Updated 4 years ago
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆110Updated 3 years ago
- ☆43Updated 3 years ago
- Pytorch-Named-Entity-Recognition-with-BERT☆15Updated 4 years ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆129Updated last year