Interspeech 2019 tutorial materials
☆49Sep 26, 2019Updated 6 years ago
Alternatives and similar repositories for INTERSPEECH19_TUTORIAL
Users that are interested in INTERSPEECH19_TUTORIAL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- INTERSPEECH 2019 Tutorial Materials☆194Mar 30, 2021Updated 5 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- A fast cnn-based vocoder☆78Jun 11, 2020Updated 5 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- Voice Conversion Challenge 2020 CycleVAE baseline system☆131Oct 19, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- Tools for Ahocoder data processing and evaluation metrics☆15Apr 22, 2024Updated last year
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Oct 27, 2020Updated 5 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition☆12Dec 8, 2015Updated 10 years ago
- ☆13Aug 11, 2018Updated 7 years ago
- A WaveRNN implementation☆201Oct 14, 2019Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Bias Tests for Voice Technologies (bt4vt)☆11Jun 16, 2024Updated last year
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- Library to build speech synthesis systems designed for easy and fast prototyping.☆399Jun 29, 2024Updated last year
- ☆53Dec 18, 2020Updated 5 years ago
- This is an implementation of the audio source separation model as well as the evaluation metrics proposed in the paper "Weakly Informed A…☆11Nov 26, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆22Jan 15, 2019Updated 7 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Mar 3, 2020Updated 6 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Multilingual Grapheme to Phoneme☆51Feb 23, 2016Updated 10 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 4 years ago
- ☆29May 4, 2020Updated 5 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Voice Conversion Tool Kit☆608Feb 27, 2023Updated 3 years ago
- Zero-data (yet trainable) probabilistic fundamental frequency estimator.☆19Jun 9, 2018Updated 7 years ago
- A Collection of Speech Corpus for ASR and TTS☆113Jun 19, 2017Updated 8 years ago
- ☆33Nov 7, 2019Updated 6 years ago
- Dataset and baseline for the first Audiocaption task☆79Jul 25, 2024Updated last year
- ☆76Mar 18, 2022Updated 4 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆33Feb 27, 2021Updated 5 years ago