tugstugi / mongolian-bert
Pre-trained Mongolian BERT models
☆43Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for mongolian-bert
- Useful resources for Mongolian NLP☆172Updated last year
- Cyrillic Mongolian text classification with tensorflow 2, and also some fine-tuning on TugsTugi's Mongolian BERT model and other NLP expe…☆32Updated last year
- The Mongolian Wordnet (MonWN)☆17Updated 2 years ago
- Mongolian speech recognition with PyTorch☆129Updated 3 years ago
- Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google…☆17Updated 3 months ago
- Pytorch-Named-Entity-Recognition-with-BERT☆15Updated 4 years ago
- Text to Speech with PyTorch (English and Mongolian)☆184Updated last month
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆72Updated last year
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆179Updated 5 years ago
- ALBERT trained on Mongolian text corpus☆18Updated 3 years ago
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 2 years ago
- ☆41Updated last year
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 3 years ago
- Text and Punctuation correction with Deep Learning☆129Updated 4 years ago
- An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.☆103Updated 3 years ago
- cLang-8 is a dataset for grammatical error correction.☆102Updated 2 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated last year
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT☆25Updated 3 years ago
- ☆33Updated 3 years ago
- A web application that interfaces two GEC systems. [web instance is down]☆31Updated 3 months ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆43Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆36Updated last year
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- Automatic Dialect Detection Repository☆39Updated 2 years ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆61Updated 4 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆150Updated 3 months ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆112Updated 5 years ago