sharavsambuu / english-mongolian-nmt-dataset-augmentationLinks
Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google Translate service
☆18Updated 2 months ago
Alternatives and similar repositories for english-mongolian-nmt-dataset-augmentation
Users that are interested in english-mongolian-nmt-dataset-augmentation are comparing it to the libraries listed below
Sorting:
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆33Updated 2 years ago
- Pre-trained Mongolian BERT models☆47Updated 4 years ago
- Useful resources for Mongolian NLP☆190Updated 8 months ago
- Mongolian speech recognition with PyTorch☆135Updated 4 years ago
- Монгол үгийн алдаа шалгах толь, Mongolian spellchecking dictionary☆85Updated last month
- The Mongolian Wordnet (MonWN)☆18Updated 3 years ago
- A simple Flask website for all NLP tasks which includes Text Preprocessing, Keyword Extraction, Text Summarization etc. Created Date: 30 …☆70Updated 3 years ago
- Text to Speech with PyTorch (English and Mongolian)☆185Updated 11 months ago
- Newspaper Segmentation into images and text☆12Updated 6 years ago
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.☆72Updated last year
- Pytorch-Named-Entity-Recognition-with-BERT☆15Updated 4 years ago
- Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.c…☆287Updated 2 years ago
- Open source speech to text models for Indic Languages☆307Updated 2 years ago
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/…☆60Updated 4 years ago
- Jupyter notebooks that use the Fastai library☆92Updated 4 years ago
- Train Spacy ner with custom dataset☆182Updated 2 years ago
- Fast and accurate spell correction library☆81Updated 3 years ago
- Text Summarization for Research Papers☆77Updated 2 years ago
- ☆43Updated 3 years ago
- A demo for a financial services bot☆320Updated 3 months ago
- This repository contains an attempt to incorporate Rasa Chatbot with state-of-the-art ASR (Automatic Speech Recognition) and TTS (Text-to…☆22Updated 5 years ago
- Automatic Post-Editing for Vietnamese☆13Updated 3 years ago
- Python package for indic script transliteration☆192Updated last week
- Named-entity recognition (NER) (also known as entity identification, entity chunking and entity extraction) is a subtask of information e…☆31Updated 5 years ago
- ☆97Updated 5 years ago
- Myanmar Word Segmentation Tool☆31Updated 6 years ago
- State of the Art Language models and Classifier for Tamil language (spoken in India, and few other South Asian countries)☆53Updated 5 years ago
- Compilation of Manually Tagged Roman Urdu Dataset (Urdu written in Latin/Roman Script), along with other helpful Roman Urdu NLP resources☆33Updated 4 years ago
- Question Generation using Google T5 and Text2Text☆153Updated 4 years ago
- Crowd sourced training data for Rasa NLU models☆201Updated last year