mzboito / IWSLT2022_Tamasheq_dataLinks

Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWSLT2022.

☆18

Alternatives and similar repositories for IWSLT2022_Tamasheq_data

Users that are interested in IWSLT2022_Tamasheq_data are comparing it to the libraries listed below

Sorting:

xinjli / phonepiece
phone inventory library
☆17Updated 2 years ago
dan-wells / kiss-aligner
Simple Kaldi recipe for forced alignment
☆11Updated 2 years ago
xinjli / ucla-phonetic-corpus
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
☆45Updated 2 years ago
gpu-poor / gramvaani_hindi_asr
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆15Updated 3 years ago
MiniXC / phones
A collection of utilities for handling IPA phones.
☆26Updated 2 years ago
sigmorphon / 2020
SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…
☆36Updated 7 months ago
besacier / mboshi-french-parallel-corpus
☆22Updated 3 years ago
aalto-speech / subword-kaldi
Properly handle position-dependent phones in a subword lexicon FST
☆31Updated 5 years ago
NickRuiz / power-asr
Phonetically-Oriented Word Error Rate
☆36Updated 6 years ago
idiap / icassp-oov-recognition
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Updated 4 years ago
Open-Speech-EkStep / data-acquisition-pipeline
☆17Updated 4 years ago
speechio / asr-noises
A handy dataset of noises for ASR
☆22Updated 6 years ago
amazon-science / proteno
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45Updated 4 years ago
fabianoluzbr / neural-g2p-portuguese
Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…
☆19Updated 4 years ago
m-wiesner / nnet_pytorch
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Updated last year
LeBenchmark / Interspeech2021
This repository describes our reproducible framework for assessing self-supervised representation learning from speech
☆51Updated 4 years ago
tiro-is / tiro-speech-core
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Updated 2 years ago
ccoreilly / deepspeech-catala
Deepspeech ASR Model for the Catalan Language
☆17Updated 4 years ago
qiujiali / lattice-rescore
☆16Updated 3 years ago
pilot7747 / VoxDIY
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Updated 4 years ago
aixplain / NoRefER
☆17Updated last year
mt-upc / SHAS
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
☆40Updated 2 years ago
pzelasko / kaldialign
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆68Updated 6 months ago
burrmill / burrmill
BurrMill core
☆22Updated 4 years ago
xinjli / asr2k
asr2k
☆52Updated last year
RuABraun / texterrors
☆37Updated 2 weeks ago
qcri / e-wer
Word Error Rate Estimation
☆15Updated 5 years ago
revdotcom / words2num
Convert words to numbers
☆21Updated 3 years ago
BUTSpeechFIT / hystoc
Getting confidences from any end-to-end systems
☆11Updated 2 years ago
CUNY-CL / wikipron-modeling
Proposed splits for the LREC Wikipron paper
☆15Updated 5 years ago