besacier / AMMIcourseLinks
☆43Updated 3 years ago
Alternatives and similar repositories for AMMIcourse
Users that are interested in AMMIcourse are comparing it to the libraries listed below
Sorting:
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated 2 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 6 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- A guide to building language technology in new languages.☆59Updated 3 years ago
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 3 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Updated 2 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated last year
- NTREX -- News Test References for MT Evaluation☆85Updated last year
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆27Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 3 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆55Updated 5 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated last year
- A program to choose transfer languages for cross-lingual learning☆72Updated 2 years ago
- Gamma Agreement in Python☆45Updated last year
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 4 years ago
- Scripts to create speech corpora from open.bible☆13Updated 3 years ago
- Utilities for Processing the Switchboard Dialogue Act Corpus☆70Updated 4 years ago
- ☆45Updated 3 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 4 months ago
- ☆56Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Multilingual speech translation☆41Updated 4 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆35Updated last month
- A curated list of research papers and resources on code-switching☆324Updated 9 months ago
- Speech2vec pre-trained word vectors☆76Updated 7 years ago
- Agile reading group that works☆13Updated 3 years ago
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆111Updated last year
- Repository for SLURP paper☆106Updated 3 years ago
- xfspell — the Transformer Spell Checker☆190Updated 5 years ago
- Build a dialog dataset from online books in many languages☆76Updated 2 years ago