Phone generation model/VAE/GAN/VAE+GAN
☆20Jun 26, 2018Updated 7 years ago
Alternatives and similar repositories for generative_model_speech
Users that are interested in generative_model_speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the code to reproduce the core results from the paper "Learning Latent Representations for Speech Generation and…☆52Apr 16, 2018Updated 8 years ago
- Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020☆16Oct 20, 2020Updated 5 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- A difficulty-aware embedding of complementary deep networks for image classification☆13Jul 25, 2024Updated last year
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Rainbow Keywords - Official PyTorch Implementation☆14Jun 27, 2024Updated last year
- Speech enhancement using mimic loss☆16Oct 25, 2019Updated 6 years ago
- Keras implementation of speech enhancement based on LSGAN☆20Dec 10, 2017Updated 8 years ago
- The phoneme classification code for EUSIPCO 2017 paper: Timbre Analysis of Music Audio Signals with Convolutional Neural Networks☆21Mar 1, 2017Updated 9 years ago
- Mirror of GlottHMM☆10Jun 7, 2016Updated 9 years ago
- ☆22Jan 15, 2019Updated 7 years ago
- Download and create a tfreader for the audioset dataset☆17Apr 16, 2020Updated 6 years ago
- Remote sensing of vegetation and crops using hyperspectral imagery and unsupervised learning methods. The project contains different appl…☆14Dec 28, 2021Updated 4 years ago
- This repository not only contains experience about parameter finetune, but also other in-practice experience such as model ensemble (boos…☆16Oct 29, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Keras-based python framework to compute phonological posterior probabilities from audio files☆48Dec 27, 2022Updated 3 years ago
- A collection of examples demonstrating how we can build speech synthesis systems using nnmnkwii.☆71May 15, 2020Updated 6 years ago
- ☆14Oct 12, 2024Updated last year
- Scene Parsing via Integrated Classification Model and Variance-Based Regularization (Matlab&Caffe), In CVPR 2019☆11Jun 11, 2019Updated 6 years ago
- The code for aishell-3 baseline acoustic model☆69Nov 30, 2020Updated 5 years ago
- ☆13Aug 11, 2018Updated 7 years ago
- Classification-by-Components: Probabilistic Modeling of Reasoning over a Set of Components [NeurIPS 2019]☆13Mar 31, 2020Updated 6 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆32Jul 6, 2023Updated 2 years ago
- Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data☆11Oct 31, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing☆89Sep 6, 2024Updated last year
- Repository of NeurIPS 2019 paper "Calibration tests in multi-class classification: A unifying framework"☆17May 6, 2021Updated 5 years ago
- HSI Band Selection☆20Nov 5, 2019Updated 6 years ago
- speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN☆36Apr 16, 2018Updated 8 years ago
- ☆13Sep 26, 2023Updated 2 years ago
- coursework from classes at UW☆11May 14, 2019Updated 7 years ago
- The code implementation of our paper "Deep Hashing Neural Networks for Hyperspectral Image Feature Extraction", GRSL, 2019☆15Aug 20, 2021Updated 4 years ago
- ☆14Apr 11, 2024Updated 2 years ago
- Variational Autoencoder with Normalizing Flows☆17Jun 17, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- ☆12Mar 23, 2026Updated 2 months ago
- Info for prospective PhD students for Chris Donahue's lab at CMU starting Fall 23.☆12Nov 13, 2022Updated 3 years ago
- Fast parallel RNN-Transducer.☆10Nov 1, 2019Updated 6 years ago
- This repository contains the code to reproduce the core results from the paper "Unsupervised Learning of Disentangled and Interpretable R…☆155Jan 30, 2018Updated 8 years ago
- This script utilizes an open source SVM library for image classification and segmentation of hyperspectral image data☆12Dec 25, 2018Updated 7 years ago
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆12Aug 29, 2023Updated 2 years ago