Emotional Speech Conversion using Style Transfer and MUNIT
☆37Apr 17, 2019Updated 6 years ago
Alternatives and similar repositories for EmoMUNIT
Users that are interested in EmoMUNIT are comparing it to the libraries listed below
Sorting:
- Emotional Speech Conversion using Nonparallel Data☆17Apr 10, 2019Updated 6 years ago
- pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020☆30Jul 6, 2023Updated 2 years ago
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆87Dec 31, 2022Updated 3 years ago
- CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer☆35Feb 4, 2025Updated last year
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆90Nov 13, 2020Updated 5 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Oct 30, 2018Updated 7 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Nov 6, 2020Updated 5 years ago
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆125Dec 14, 2020Updated 5 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- This repository contains code to replicate results from the ICASSP 2020 paper "StarGAN for Emotional Speech Conversion: Validated by Data…☆137Oct 24, 2021Updated 4 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- Multiple Transformation Function Estimation for Image Enhancement☆22Oct 20, 2024Updated last year
- CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)☆59Jul 26, 2022Updated 3 years ago
- Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.☆29Dec 18, 2019Updated 6 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆248Mar 24, 2023Updated 2 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆400Sep 30, 2024Updated last year
- voice conversion system☆25Jun 10, 2020Updated 5 years ago
- Demo for 2022 Interspeech☆29Jun 14, 2022Updated 3 years ago
- The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems☆283Oct 10, 2023Updated 2 years ago
- ☆121Oct 24, 2022Updated 3 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆34Jun 13, 2018Updated 7 years ago
- Python library to forecast univariate time series through backtesting model selection☆23Jun 12, 2024Updated last year
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆148Jan 15, 2024Updated 2 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆317Aug 25, 2021Updated 4 years ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,092Oct 23, 2024Updated last year
- Voice Conversion by CycleGAN (语音克隆/语音转换):CycleGAN-VC3☆155May 5, 2022Updated 3 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Instagram Automation Tool is a framework that automates various Instagram tasks, including file-based operations and web automation (via …☆15May 4, 2025Updated 9 months ago
- Using pre-trained YOLO algorithm to detect faces in photo ID documents for ID verification☆10Apr 3, 2018Updated 7 years ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆42Mar 12, 2023Updated 2 years ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆50Mar 25, 2025Updated 11 months ago
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 5 months ago
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'☆41Aug 1, 2018Updated 7 years ago
- Speech (audio) subjective evaluation system☆42Jul 15, 2020Updated 5 years ago
- TTS for pitch-accented language. Korean dialect DB.☆157May 12, 2023Updated 2 years ago
- CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion☆41Mar 2, 2020Updated 6 years ago
- Global Rhythm Style Transfer Without Text Transcriptions☆285Oct 23, 2024Updated last year
- Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model☆192Jul 30, 2024Updated last year