bayartsogt-ya / albert-mongolian
ALBERT trained on Mongolian text corpus
☆18Updated 4 years ago
Alternatives and similar repositories for albert-mongolian:
Users that are interested in albert-mongolian are comparing it to the libraries listed below
- Pytorch-Named-Entity-Recognition-with-BERT☆15Updated 4 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆74Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- ☆42Updated 3 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 4 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆49Updated last year
- Pre-trained Mongolian BERT models☆46Updated 4 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 10 months ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Updated 4 years ago
- YiSi: A Semantic Machine Translation Evaluation Metric for Evaluating Languages with Different Levels of Available Resources☆25Updated 5 years ago
- Multilingual speech translation☆41Updated 4 years ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attention☆41Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 5 years ago
- NTREX -- News Test References for MT Evaluation☆83Updated 10 months ago
- ☆34Updated 4 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 3 years ago
- ☆92Updated last year
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- ☆44Updated 4 years ago
- ASR project with pytorch-lightning☆20Updated last month
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- a repository containing the details of natural language inference dataset in Hindi☆11Updated 4 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Add noise to your text, can be used to improve synthetic training corpus for Neural Machine Translation☆41Updated 5 years ago
- Scripts for finetuning m2m-100 models☆16Updated 2 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- ☆12Updated 9 years ago
- LTG-Bert☆32Updated last year
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆27Updated 4 years ago