Phone generation model/VAE/GAN/VAE+GAN
☆20Jun 26, 2018Updated 7 years ago
Alternatives and similar repositories for generative_model_speech
Users that are interested in generative_model_speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and…☆52Apr 16, 2018Updated 7 years ago
- Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020☆16Oct 20, 2020Updated 5 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- Rainbow Keywords - Official PyTorch Implementation☆14Jun 27, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Speech enhancement using mimic loss☆16Oct 25, 2019Updated 6 years ago
- Keras implementation of speech enhancement based on LSGAN☆20Dec 10, 2017Updated 8 years ago
- The phoneme classification code for EUSIPCO 2017 paper: Timbre Analysis of Music Audio Signals with Convolutional Neural Networks☆21Mar 1, 2017Updated 9 years ago
- Mirror of GlottHMM☆10Jun 7, 2016Updated 9 years ago
- ☆22Jan 15, 2019Updated 7 years ago
- Download and create a tfreader for the audioset dataset☆16Apr 16, 2020Updated 5 years ago
- This repository not only contains experience about parameter finetune, but also other in-practice experience such as model ensemble (boos…☆16Oct 29, 2017Updated 8 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Dec 27, 2022Updated 3 years ago
- A collection of examples demonstrating how we can build speech synthesis systems using nnmnkwii.☆71May 15, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆14Oct 12, 2024Updated last year
- The code for aishell-3 baseline acoustic model☆69Nov 30, 2020Updated 5 years ago
- ☆13Aug 11, 2018Updated 7 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆32Jul 6, 2023Updated 2 years ago
- Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data☆10Oct 31, 2018Updated 7 years ago
- Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing☆89Sep 6, 2024Updated last year
- speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN☆36Apr 16, 2018Updated 7 years ago
- coursework from classes at UW☆12May 14, 2019Updated 6 years ago
- ☆13Sep 26, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Apr 11, 2024Updated 2 years ago
- Variational Autoencoder with Normalizing Flows☆17Jun 17, 2017Updated 8 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- Info for prospective PhD students for Chris Donahue's lab at CMU starting Fall 23.☆12Nov 13, 2022Updated 3 years ago
- Fast parallel RNN-Transducer.☆10Nov 1, 2019Updated 6 years ago
- ☆11Mar 23, 2026Updated 3 weeks ago
- This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable R…☆155Jan 30, 2018Updated 8 years ago
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆12Aug 29, 2023Updated 2 years ago
- Audio captioning recipe☆52Oct 23, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆12Oct 27, 2021Updated 4 years ago
- This repository contains supplementary material for the paper: "Audio Source Separation Using Variational Autoencoders and Weak Class Sup…☆11Jan 10, 2023Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- 语智科技远场(单麦克风)语音识别引擎 FFASR 接入指南☆15Aug 4, 2023Updated 2 years ago
- ☆10Jan 26, 2021Updated 5 years ago
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆25Apr 20, 2022Updated 3 years ago