JSALT2022CodeSwitchingASR / generating-code-switched-audioLinks

☆12

Alternatives and similar repositories for generating-code-switched-audio

Users that are interested in generating-code-switched-audio are comparing it to the libraries listed below

Sorting:

pkufool / simple-wer
A simple command line tool to calculate WER for ASR.
☆14Updated 9 months ago
bshall / dusted
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Updated 9 months ago
speechio / asr-noises
A handy dataset of noises for ASR
☆21Updated 6 years ago
KrishnaDN / BERTphone
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Updated 4 years ago
skhu101 / Bayesian_TDNN
This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…
☆9Updated 3 years ago
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 3 years ago
idiap / zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆21Updated last year
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
yuhangear / wenet-android
☆12Updated 3 years ago
csalt-research / accented-codebooks-asr
☆18Updated 10 months ago
gpu-poor / gramvaani_hindi_asr
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆15Updated 3 years ago
ArenAcikgoz / Whisper-Alignment
Forced alignment decoder for Whisper.
☆14Updated last year
naver / multilingual-distilwhisper
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆27Updated last year
ductuantruong / speaker_age_estimation_ssl_study
Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Updated 2 years ago
cpii-cai / PunCantonese
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆14Updated 7 months ago
qiujiali / lattice-rescore
☆16Updated 3 years ago
alumae / streaming-punctuator
☆17Updated 2 years ago
nervjack2 / Speech2Unit
☆13Updated 9 months ago
ttslr / MonTTS
☆13Updated 3 years ago
chaufanglin / Normal2Whisper
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆10Updated 8 months ago
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
leohuang2013 / pyannote-audio_overlapped-speech-detection_cpp
C++ version of pyannote audio overlapped speech detection pipeline
☆13Updated last year
ashi-ta / speechGLUE
SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.
☆13Updated 2 years ago
reppy4620 / x-vits
☆13Updated 8 months ago
Mu-Y / DiariST
☆19Updated last year
hmohebbi / disentangling_representations
☆12Updated 9 months ago
WangHelin1997 / Aty-TTS
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆10Updated 2 months ago
p1an-lin-jung / wv_tts
☆19Updated last year
utter-project / mHuBERT-147-scripts
Collection of scripts from mHuBERT-147.
☆29Updated 7 months ago
luomingshuang / k2-speechbrain
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Updated 3 years ago