sharavsambuu / mongolian-text-classification
Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP experiments are included.
☆32Updated last year
Related projects ⓘ
Alternatives and complementary repositories for mongolian-text-classification
- Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google…☆17Updated 3 months ago
- Useful resources for Mongolian NLP☆172Updated last year
- Pre-trained Mongolian BERT models☆43Updated 3 years ago
- Mongolian speech recognition with PyTorch☆129Updated 3 years ago
- The Mongolian Wordnet (MonWN)☆17Updated 2 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- Pytorch-Named-Entity-Recognition-with-BERT☆15Updated 4 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆36Updated last year
- Deep Learning neural network for correcting spelling☆54Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆150Updated 4 months ago
- Arabic edition of BERT pretrained language models☆127Updated 3 years ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.☆49Updated 5 years ago
- Text and Punctuation correction with Deep Learning☆129Updated 4 years ago
- TUFS Asian Language Parallel Corpus☆48Updated last year
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- Arabic Dialect Identification on AOC data.☆23Updated 5 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆437Updated 7 months ago
- JFLEG (JHU FLuency-Extended GUG) corpus for Grammatical Error Correction Evaluation☆112Updated last year
- Crawler for linguistic corpora☆193Updated 11 months ago
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆179Updated 5 years ago
- ☆42Updated 6 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated last year
- Text to Speech with PyTorch (English and Mongolian)☆184Updated last month
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆11Updated 2 years ago
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 4 years ago
- Монгол үгийн алдаа шалгах толь, Mongolian spellchecking dictionary☆37Updated this week
- Machine-Translation-based sentence alignment tool for parallel text☆300Updated 3 years ago
- Next Word Prediction using n-gram Probabilistic Model with various Smoothing Techniques☆35Updated 6 years ago
- GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package al…☆264Updated last year
- cLang-8 is a dataset for grammatical error correction.☆102Updated 2 years ago