rhoposit/icassp2021

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/rhoposit/icassp2021)

rhoposit / icassp2021

☆15

Alternatives and similar repositories for icassp2021

Users that are interested in icassp2021 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 5 years ago
nii-yamagishilab / Extended_VQVAE
View on GitHub
☆64Aug 14, 2023Updated 2 years ago
ZackHodari / average_prosody
View on GitHub
Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…
☆24Dec 8, 2019Updated 6 years ago
meelement / noise_adversarial_tacotron
View on GitHub
Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…
☆17Aug 15, 2019Updated 6 years ago
mutiann / speech_rankings
View on GitHub
A CSRankings-like index for speech researchers
☆35Oct 16, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
thuhcsi / icassp2021-emotion-tts
View on GitHub
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
☆34Mar 17, 2023Updated 3 years ago
sarulab-speech / lightweight_spkr_anon
View on GitHub
Lightweight speaker anonymization [IEEE SLT2021]
☆27Jun 6, 2022Updated 4 years ago
Takaaki-Saeki / ssl_speech_restoration
View on GitHub
SelfRemaster: SSL Speech Restoration
☆94Jan 5, 2024Updated 2 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
nc-ai / speech
View on GitHub
☆17Aug 27, 2025Updated 11 months ago
YoungSeng / SRD-VC
View on GitHub
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆119Feb 7, 2024Updated 2 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
rgzn-aiyun / tacotron2-melgan
View on GitHub
Mel spectrum based on tacotron2 for melgan speech synthesis
☆15Mar 24, 2023Updated 3 years ago
LeoniusChen / Attentions-in-Tacotron
View on GitHub
☆69Mar 31, 2021Updated 5 years ago
WelkinYang / WaveODE
View on GitHub
An ODE-based generative neural vocoder using Rectified Flow
☆58Apr 29, 2023Updated 3 years ago
bshall / VectorQuantizedCPC
View on GitHub
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
☆142Sep 1, 2020Updated 5 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
Yangyangii / AdvDCTTS
View on GitHub
Implementation of DCTTS with Adversarial Training
☆12Dec 30, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
xcmyz / Lifelong-Learning-Tacotron2
View on GitHub
MultiSpeaker Tacotron2 using LifeLong Learning.
☆13Sep 27, 2019Updated 6 years ago
rishikksh20 / PPSpeech
View on GitHub
PPSpeech: Phrase based Parallel End-to-End TTS System
☆35Aug 31, 2020Updated 5 years ago
ttslr / MonTTS
View on GitHub
☆16Dec 23, 2021Updated 4 years ago
xinshengwang / ICASSP2021_paper_list-VC
View on GitHub
ICASSP 2021 accepted papers in term of voice conversion (VC)
☆18Apr 11, 2021Updated 5 years ago
adasegroup / OSM-one-shot-multispeaker
View on GitHub
Framework for one-shot multispeaker system based on Deep Learning
☆19May 30, 2021Updated 5 years ago
dafyddg / RFA
View on GitHub
Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…
☆17Apr 27, 2023Updated 3 years ago
CSTR-Edinburgh / ophelia
View on GitHub
Sequence-to-sequence TTS based on Kyubyong's dc_tts
☆61Feb 2, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
keonlee9420 / VAENAR-TTS
View on GitHub
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
☆74Aug 3, 2021Updated 4 years ago
MingjieChen / VoiceConversionGANs
View on GitHub
GAN series for voice conversion on VCC2018 dataset
☆17Aug 27, 2020Updated 5 years ago
CMsmartvoice / Unet-TTS
View on GitHub
One-shot TTS with Improved Unseen Speaker and Style Transfer
☆37Mar 2, 2022Updated 4 years ago
iamyuanchung / Autoregressive-Predictive-Coding
View on GitHub
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
☆191Jan 29, 2020Updated 6 years ago
nii-yamagishilab / VCC2020-database
View on GitHub
☆53Dec 18, 2020Updated 5 years ago
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
facebookresearch / speech-resynthesis
View on GitHub
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…
☆416Aug 29, 2023Updated 2 years ago