Maitreyapatel/speech-conversion-between-different-modalities

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Maitreyapatel/speech-conversion-between-different-modalities)

Maitreyapatel / speech-conversion-between-different-modalities

Generative Adversarial Networks for different impaired speech conversions

☆39

Alternatives and similar repositories for speech-conversion-between-different-modalities

Users that are interested in speech-conversion-between-different-modalities are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dwgnr / speech-conversion
View on GitHub
Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE
☆15Dec 3, 2022Updated 3 years ago
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
kamepong / ConvS2S-VC
View on GitHub
☆28Dec 14, 2021Updated 4 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
WangHelin1997 / DuTa-VC
View on GitHub
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆38Dec 5, 2023Updated 2 years ago
k2kobayashi / crank
View on GitHub
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
☆171Jul 25, 2024Updated last year
dmse4tts / DMSE4TTS
View on GitHub
☆24May 6, 2025Updated last year
rkmt / wesper-demo
View on GitHub
☆36Dec 25, 2023Updated 2 years ago
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
keonlee9420 / WaveGrad2
View on GitHub
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
☆68Aug 3, 2021Updated 4 years ago
thuhcsi / SnakeGAN
View on GitHub
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Apr 25, 2023Updated 3 years ago
jayneelparekh / sp2si-code
View on GitHub
Contains code for our work on speech to singing conversion (ICASSP 2020)
☆50Oct 27, 2020Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ttslr / MonTTS
View on GitHub
☆16Dec 23, 2021Updated 4 years ago
zengchang233 / CrossSinger
View on GitHub
The source code for the paper CrossSinger (asru2023)
☆18Oct 12, 2023Updated 2 years ago
multimedia-berkeley / deep_hashing_coverSongDetection
View on GitHub
Cover Song Detection System
☆10Mar 29, 2019Updated 7 years ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
ASLP-lab / OmniCodec
View on GitHub
OmniCodec: Low Frame Rate Universal Audio Codec with Semantic–Acoustic Disentanglement
☆46Apr 17, 2026Updated 3 months ago
liusongxiang / ppg-vc
View on GitHub
PPG-Based Voice Conversion
☆348Jul 22, 2022Updated 4 years ago
wenet-e2e / WeTextProcessing.deprecated
View on GitHub
☆61Jan 31, 2023Updated 3 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
exercise-book-yq / Supercodec
View on GitHub
☆51Mar 5, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yoyolicoris / spectrogram-inversion
View on GitHub
spectrogram inversion tools in PyTorch. Documentation: https://spectrogram-inversion.readthedocs.io
☆51Jun 12, 2025Updated last year
Miffyli / asv-cm-reinforce
View on GitHub
Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE
☆13Mar 31, 2021Updated 5 years ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 4 years ago
MingjieChen / VoiceConversionGANs
View on GitHub
GAN series for voice conversion on VCC2018 dataset
☆17Aug 27, 2020Updated 5 years ago
RF5 / simple-asgan
View on GitHub
Training code and trained checkpoints for ASGAN.
☆62Dec 27, 2023Updated 2 years ago
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
inconnu11 / Objective-evaluation_speech_synthesis
View on GitHub
☆17Mar 24, 2022Updated 4 years ago
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
fgnt / pb_chime5
View on GitHub
Speech enhancement system for the CHiME-5 dinner party scenario
☆111Feb 6, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
shincling / discreteSeparation
View on GitHub
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Oct 25, 2021Updated 4 years ago
ws-choi / LASAFT-Net-v2
View on GitHub
A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"
☆33Apr 11, 2022Updated 4 years ago
KunZhou9646 / emotional-voice-conversion-with-CycleGAN-and-CWT-for-Spectrum-and-F0
View on GitHub
This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…
☆124Dec 14, 2020Updated 5 years ago
frankenliu / LOAE
View on GitHub
☆10Sep 25, 2024Updated last year
wenet-e2e / wecut
View on GitHub
video cut powered by AI
☆23Nov 15, 2022Updated 3 years ago
fchest / Speech-Transformer-multi-GPUs
View on GitHub
A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…
☆10Dec 25, 2019Updated 6 years ago