kevobt / speech-to-text-voxforgeLinks

Downloader for the voxforge corpus

☆8

Alternatives and similar repositories for speech-to-text-voxforge

Users that are interested in speech-to-text-voxforge are comparing it to the libraries listed below

Sorting:

LeBenchmark / Interspeech2021
This repository describes our reproducible framework for assessing self-supervised representation learning from speech
☆51Updated 3 years ago
MarkWuNLP / SemanticMask
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆38Updated 4 years ago
csukuangfj / transducer-loss-benchmarking
☆68Updated 3 years ago
iamjanvijay / rnnt_decoder_cuda
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
☆68Updated 4 years ago
asappresearch / multistream-cnn
Multistream CNN for Robust Acoustic Modeling
☆40Updated 3 years ago
TParcollet / E2E-SincNet
E2E-SincNet: Toward fully end-to-end speech recognition
☆30Updated 5 years ago
vvestman / pytorch-ivectors
GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…
☆64Updated 5 years ago
mt-upc / SHAS
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
☆38Updated 2 years ago
jtrmal / kaldi2020
☆27Updated 4 years ago
wavlab-speech / cmu_multilingual_speech
CMU multilingual speech repository
☆31Updated 3 years ago
asappresearch / slue-toolkit
A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…
☆65Updated last year
idiap / icassp-oov-recognition
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Updated 3 years ago
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 5 months ago
Chia-Hsuan-Lee / Spoken-SQuAD
A spoken question answering dataset on SQUAD
☆49Updated last month
gonenhila / codeswitching-lm
Language Modeling for Code-Switching
☆9Updated 5 years ago
dcaulley / av_diarization
AudioVisual Diarization - Supervised and Unsupervised
☆14Updated 2 years ago
lorenlugosch / transducer-tutorial
Example code for a neural transducer model.
☆61Updated last year
aalto-speech / subword-kaldi
Properly handle position-dependent phones in a subword lexicon FST
☆31Updated 4 years ago
thu-spmi / ASR-Benchmarks
An effort to track benchmarking results over widely-used datasets for ASR.
☆46Updated 3 years ago
k2-fsa / fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆141Updated last year
NickRuiz / power-asr
Phonetically-Oriented Word Error Rate
☆35Updated 6 years ago
awslabs / speech-representations
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆103Updated 2 years ago
Dannynis / xvector_pytorch
A pytorch implementation of xvector embedding
☆79Updated 5 years ago
cornerfarmer / ctc_segmentation
Segment a given audio into utterances using a trained end-to-end ASR model.
☆73Updated 4 years ago
shane-settle / neural-acoustic-word-embeddings
☆45Updated 6 years ago
zerospeech / zerospeech2021_baseline
BERT and LSTM baseline models of the ZeroSpeech Challenge 2021
☆60Updated 2 years ago
sonos / spoken-language-understanding-research-datasets
☆49Updated 3 years ago
FlorianKrey / DNC
Discriminative Neural Clustering for Speaker Diarisation
☆78Updated 3 years ago
hainan-xv / PASM
Pronunciation-assisted Subword Modeling
☆29Updated 6 years ago
grtzsohalf / SpeechNet-codebase
☆20Updated 4 years ago