Phone generation model/VAE/GAN/VAE+GAN
☆20Jun 26, 2018Updated 7 years ago
Alternatives and similar repositories for generative_model_speech
Users that are interested in generative_model_speech are comparing it to the libraries listed below
Sorting:
- This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and…☆52Apr 16, 2018Updated 7 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- Speech enhancement using mimic loss☆16Oct 25, 2019Updated 6 years ago
- Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020☆16Oct 20, 2020Updated 5 years ago
- Keras implementation of speech enhancement based on LSGAN☆20Dec 10, 2017Updated 8 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Dec 27, 2022Updated 3 years ago
- ☆22Jan 15, 2019Updated 7 years ago
- The phoneme classification code for EUSIPCO 2017 paper: Timbre Analysis of Music Audio Signals with Convolutional Neural Networks☆21Mar 1, 2017Updated 9 years ago
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆25Apr 20, 2022Updated 3 years ago
- Voice conversion tools for STRAIGHT☆29Jul 17, 2020Updated 5 years ago
- A collection of examples demonstrating how we can build speech synthesis systems using nnmnkwii.☆71May 15, 2020Updated 5 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆35Dec 31, 2023Updated 2 years ago
- The code for aishell-3 baseline acoustic model☆69Nov 30, 2020Updated 5 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆32Jul 6, 2023Updated 2 years ago
- Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing☆89Sep 6, 2024Updated last year
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data☆10Oct 31, 2018Updated 7 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Apr 25, 2018Updated 7 years ago
- ☆16Sep 28, 2024Updated last year
- Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…☆10Jul 21, 2023Updated 2 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Mar 24, 2023Updated 2 years ago
- Speech (audio) subjective evaluation system☆42Jul 15, 2020Updated 5 years ago
- speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN☆35Apr 16, 2018Updated 7 years ago
- ☆17Oct 6, 2025Updated 4 months ago
- Minimal expected switch duration toolbox (MESD toolbox)☆11Apr 20, 2021Updated 4 years ago
- The GitHub repository contains the online system code derived from the 'visual tracking brain-computer interface' research. This code enc…☆11Jan 24, 2024Updated 2 years ago
- Official source code for Time is Not Enough: Time-Frequency based Explanation for Time-Series Black-Box Models☆12Dec 5, 2024Updated last year
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- ☆13Sep 5, 2023Updated 2 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- The code for "EEG-based Emotion Recognition Using Convolutional Neural Network with Functional Connections"☆13Dec 8, 2020Updated 5 years ago
- Stellenbosch University ZeroSpeech 2019 System☆10Apr 4, 2019Updated 6 years ago
- Mirror of GlottHMM☆10Jun 7, 2016Updated 9 years ago
- Official Implementation of Integrating Physics-Informed Vectors for Improved Wind Speed Forecasting with Neural Networks☆12Mar 24, 2025Updated 11 months ago
- Audio captioning recipe☆51Oct 23, 2025Updated 4 months ago
- Urdu Word Segmentation using Conditional Random Fields (CRFs)☆12Oct 3, 2018Updated 7 years ago
- Pretrained spoken language classifiers from audio.☆10Jan 21, 2021Updated 5 years ago