kan-bayashi / INTERSPEECH19_TUTORIALView external linksLinks
Interspeech 2019 tutorial materials
☆49Sep 26, 2019Updated 6 years ago
Alternatives and similar repositories for INTERSPEECH19_TUTORIAL
Users that are interested in INTERSPEECH19_TUTORIAL are comparing it to the libraries listed below
Sorting:
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Oct 27, 2020Updated 5 years ago
- Tools for Ahocoder data processing and evaluation metrics☆15Apr 22, 2024Updated last year
- A fast cnn-based vocoder☆78Jun 11, 2020Updated 5 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Jul 25, 2024Updated last year
- Voice Conversion Challenge 2020 CycleVAE baseline system☆131Oct 19, 2020Updated 5 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- INTERSPEECH 2019 Tutorial Materials☆194Mar 30, 2021Updated 4 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- Zero-data (yet trainable) probabilistic fundamental frequency estimator.☆19Jun 9, 2018Updated 7 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆31May 14, 2024Updated last year
- ☆76Mar 18, 2022Updated 3 years ago
- Dataset and baseline for the first Audiocaption task☆79Jul 25, 2024Updated last year
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Addressing the confounds of accompaniments in singer identification☆18Mar 24, 2020Updated 5 years ago
- Tool to aid in the creation of mashups☆19Apr 7, 2020Updated 5 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- ☆74Apr 4, 2024Updated last year
- This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition☆12Dec 8, 2015Updated 10 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Quasi-Periodic WaveNet Pytorch implementation☆13Mar 27, 2021Updated 4 years ago
- NNSVS向けの教師データのラベル作成支援ツールです。☆10Apr 5, 2023Updated 2 years ago
- Library to build speech synthesis systems designed for easy and fast prototyping.☆398Jun 29, 2024Updated last year
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- Python wrapper for Sinsy☆53Oct 9, 2023Updated 2 years ago
- ☆22Jan 15, 2019Updated 7 years ago
- Multilingual Grapheme to Phoneme☆51Feb 23, 2016Updated 9 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago