minkjung / blankcollapseLinks

☆10

Alternatives and similar repositories for blankcollapse

Users that are interested in blankcollapse are comparing it to the libraries listed below

Sorting:

jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Updated 4 years ago
KrishnaDN / BERTphone
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Updated 4 years ago
AppleHolic / FastSpeech2
Refactored version of https://github.com/ming024/FastSpeech2
☆14Updated 3 years ago
qiujiali / lattice-rescore
☆16Updated 3 years ago
Sreyan88 / LipGER
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆17Updated last year
naver / multilingual-distilwhisper
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆27Updated last year
JSALT2022CodeSwitchingASR / generating-code-switched-audio
☆12Updated 5 months ago
speech-paper-reading / speech-paper-reading
Repository for speech paper reading
☆33Updated 3 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆21Updated last year
zldzmfoq12 / VCtube
A pakage for crawling audio from Youtube
☆42Updated last year
speechio / asr-noises
A handy dataset of noises for ASR
☆21Updated 6 years ago
ag1988 / mel-asr
The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…
☆19Updated 9 months ago
idiap / contextual-biasing-on-gpus
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆20Updated last year
nc-ai / speech
☆17Updated last month
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
skhu101 / Bayesian_TDNN
This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…
☆9Updated 3 years ago
amazon-science / proteno
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45Updated 4 years ago
sigmorphon / 2020
SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…
☆36Updated 2 months ago
wavlab-speech / cmu_multilingual_speech
CMU multilingual speech repository
☆31Updated 3 years ago
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 3 years ago
hfutami / distill-bert-for-seq2seq-asr
☆24Updated 5 years ago
sooftware / speech-recognition-papers
Awesome Automatic Speech Recognition (ASR) paper collection
☆19Updated 4 years ago
speechnovateur / languagecodec_tmp
Temporary anonymous version
☆22Updated last year
bshall / dusted
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Updated 9 months ago
idiap / icassp-oov-recognition
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Updated 3 years ago
nervjack2 / Speech2Unit
☆13Updated 9 months ago
hlt-mt / Speech-MASSIVE
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆22Updated 10 months ago
cyfer0618 / kaldi-pytorch-rnnlm
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Updated 5 years ago
iamjanvijay / rnnt
An implementation of RNN-Transducer loss in TF-2.0.
☆45Updated 2 years ago
kamperh / vqwordseg
Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.
☆37Updated last year