sharavsambuu / mongolian-text-classification
Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP experiments are included.
☆32Updated 2 years ago
Alternatives and similar repositories for mongolian-text-classification
Users that are interested in mongolian-text-classification are comparing it to the libraries listed below
Sorting:
- Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google…☆18Updated 3 months ago
- Useful resources for Mongolian NLP☆184Updated 5 months ago
- Pre-trained Mongolian BERT models☆46Updated 4 years ago
- Mongolian speech recognition with PyTorch☆134Updated 4 years ago
- Pytorch-Named-Entity-Recognition-with-BERT☆15Updated 4 years ago
- The Mongolian Wordnet (MonWN)☆17Updated 3 years ago
- (Work in progress) React documentation website in Mongolian☆41Updated this week
- Text to Speech with PyTorch (English and Mongolian)☆185Updated 7 months ago
- Монгол үгийн алдаа шалгах толь, Mongolian spellchecking dictionary☆83Updated last month
- cLang-8 is a dataset for grammatical error correction.☆104Updated 2 years ago
- Lecture and seminar materials for Deep Learning summer school in Ulaanbaatar, 2021☆10Updated 3 years ago
- Bitextor generates translation memories from multilingual websites☆293Updated 6 months ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆180Updated 6 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆162Updated 10 months ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆26Updated 2 years ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.☆48Updated 6 years ago
- ☆119Updated 4 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆154Updated 5 years ago
- Machine Translation Web Interface for OpenNMT-py☆25Updated 3 years ago
- Efficient Low-Memory Aligner☆143Updated 4 months ago
- Datasets and tools for basic natural language processing.☆381Updated 3 years ago
- A tool for converting TMX files into bilingual corpora☆18Updated 5 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆157Updated 11 months ago
- NTREX -- News Test References for MT Evaluation☆83Updated 11 months ago
- Python library for converting numbers to words for all Indian Languages.☆35Updated 4 months ago
- CMU Wilderness Multilingual Speech Dataset☆280Updated 6 years ago
- XenC: open-source data selection tool for NLP☆64Updated 9 years ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆163Updated last year
- OpusFilter - Parallel corpus processing toolkit☆104Updated last month