besacier / AMMIcourseLinks
☆43Updated 3 years ago
Alternatives and similar repositories for AMMIcourse
Users that are interested in AMMIcourse are comparing it to the libraries listed below
Sorting:
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated 2 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆115Updated 6 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- A guide to building language technology in new languages.☆59Updated 3 years ago
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 3 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Updated 3 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆31Updated 2 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆86Updated last year
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 8 months ago
- A program to choose transfer languages for cross-lingual learning☆73Updated 3 months ago
- Scripts to create speech corpora from open.bible☆13Updated 3 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆56Updated 5 years ago
- Agile reading group that works☆13Updated 3 years ago
- ☆50Updated last year
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated last year
- A curated list of research papers and resources on code-switching☆328Updated last year
- Repository for SLURP paper☆107Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 3 years ago
- Utilities for Processing the Switchboard Dialogue Act Corpus☆72Updated 4 years ago
- ☆17Updated 3 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 3 years ago
- ☆56Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Updated 4 years ago
- Implementation of the GBST block from the Charformer paper, in Pytorch☆118Updated 4 years ago
- ☆45Updated 3 years ago
- xfspell — the Transformer Spell Checker☆189Updated 5 years ago
- Speech2vec pre-trained word vectors☆76Updated 7 years ago
- Multilingual speech translation☆41Updated 4 years ago
- Add noise to your text, can be used to improve synthetic training corpus for Neural Machine Translation☆41Updated 6 years ago