Mihir3009 / whisper-to-speechLinks

The aim of this project is to make voice assistants more responsive towards whisper to some extent.

☆10

Alternatives and similar repositories for whisper-to-speech

Users that are interested in whisper-to-speech are comparing it to the libraries listed below

Sorting:

amphionspace / tts-evaluation
An evaluation set for large-scale trained TTS models (Coming in Sep 2024)
☆12Updated 10 months ago
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
ryota-komatsu / speech_resynth
Speech Resynthesis and Language Modeling
☆20Updated last month
jisang93 / VISinger
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆15Updated 2 years ago
p1an-lin-jung / wv_tts
☆19Updated last year
lexkoro / StyleTTS
☆11Updated 2 years ago
yoongi43 / VRVQ
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Updated 3 months ago
huutuongtu / Lightvoc
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18Updated last year
lexkoro / cfm-vc
☆11Updated 4 months ago
ZehuaKcrissLi / GTR-Voice
☆13Updated 8 months ago
meaningTeam / tidy-tunes
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆21Updated last week
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated 11 months ago
idiap / zff_vad
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆21Updated last year
cyhuang-tw / robust-vc
☆11Updated 3 years ago
shengcanxu / canoSpeech
text to speech
☆10Updated last year
asuni / PitchSqueezer
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆34Updated last year
PanagiotisP / svs-multiband
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022
☆15Updated 3 years ago
lifeiteng / VoiceBox
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆27Updated last year
zengchang233 / CrossSinger
The source code for the paper CrossSinger (asru2023)
☆18Updated last year
karchkha / MelSpec_GPT_VQVAE
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Updated last year
Chengyuann / AutoStyle-TTS
Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…
☆14Updated 4 months ago
b-sigpro / sed-hsmm
Onset-and-Offset-Aware Sound Event Detection
☆17Updated 5 months ago
ZhaoF-i / ASTWS-AEC
Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation
☆16Updated last week
ogunlao / glowtts_stdp
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆18Updated 2 years ago
alumae / torch-xvectors-wav
☆22Updated 4 years ago
thuhcsi / PortableTTS
☆12Updated 2 years ago
sarulab-speech / spatial_voice_conversion
Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
☆17Updated 11 months ago
BUTSpeechFIT / TS_SUPERB
☆15Updated 3 months ago
reppy4620 / convnext_tts
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆17Updated 8 months ago
AI-S2-Lab / GPT-Talker
[ACMMM'2024] Generative Expressive Conversational Speech Synthesis
☆36Updated 8 months ago