besacier / AMMIcourseLinks
☆43Updated 3 years ago
Alternatives and similar repositories for AMMIcourse
Users that are interested in AMMIcourse are comparing it to the libraries listed below
Sorting:
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆115Updated 6 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated 2 years ago
- A guide to building language technology in new languages.☆59Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated 2 years ago
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 3 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated last year
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 7 months ago
- Scripts to create speech corpora from open.bible☆13Updated 3 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆56Updated 5 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆30Updated 2 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆35Updated 3 years ago
- Multilingual speech translation☆41Updated 4 years ago
- NTREX -- News Test References for MT Evaluation☆86Updated last year
- Repository for SLURP paper☆106Updated 3 years ago
- ☆56Updated 2 years ago
- Speech2vec pre-trained word vectors☆76Updated 7 years ago
- A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset☆13Updated 4 years ago
- ☆15Updated 6 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 3 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated 2 months ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆36Updated 4 months ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 4 years ago
- Agile reading group that works☆13Updated 3 years ago
- A curated list of research papers and resources on code-switching☆328Updated 11 months ago
- PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning☆229Updated 4 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆157Updated 5 years ago
- ☆51Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Updated 4 years ago
- Utilities for Processing the Switchboard Dialogue Act Corpus☆72Updated 4 years ago