besacier / AMMIcourse
☆42Updated 3 years ago
Alternatives and similar repositories for AMMIcourse:
Users that are interested in AMMIcourse are comparing it to the libraries listed below
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆75Updated last year
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 5 years ago
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 2 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago
- Scripts to create speech corpora from open.bible☆13Updated 3 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆49Updated last year
- NTREX -- News Test References for MT Evaluation☆83Updated 11 months ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆33Updated 2 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆35Updated 4 months ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆49Updated 7 months ago
- A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset☆12Updated 4 years ago
- Multilingual speech translation☆41Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆25Updated 2 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated last week
- A merged version of multiple open-source German speech datasets.☆31Updated last year
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆105Updated last year
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆15Updated 4 years ago
- Agile reading group that works☆13Updated 3 years ago
- ☆56Updated 2 years ago
- Gamma Agreement in Python☆43Updated last year
- Tool to fix bitexts and tag near-duplicates for removal☆30Updated 3 months ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 4 years ago
- ☆47Updated 9 months ago
- Morfessor EM+Prune☆10Updated 4 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago