PAVOQUE Corpus of Expressive Speech
☆12Aug 2, 2016Updated 9 years ago
Alternatives and similar repositories for pavoque-data
Users that are interested in pavoque-data are comparing it to the libraries listed below
Sorting:
- Speech waveform synthesis filters☆13Jul 21, 2017Updated 8 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆10Nov 1, 2025Updated 4 months ago
- visual-text to speech☆14Apr 3, 2022Updated 3 years ago
- vad algorithm based on esp32 for mute detection☆13Dec 9, 2018Updated 7 years ago
- ☆16Sep 12, 2019Updated 6 years ago
- Instructions for reproducing the research described in the paper "Tempo Estimation for Music Loops and a Simple Confidence Measure"☆14Nov 18, 2016Updated 9 years ago
- NSynth for the rest of us☆14May 12, 2017Updated 8 years ago
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Feb 8, 2023Updated 3 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- Multiple Fundamental Frequency Estimation☆27Apr 7, 2014Updated 11 years ago
- generative models for speech☆20Jul 4, 2016Updated 9 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON☆18Feb 26, 2021Updated 5 years ago
- Implementations of differentiable stacks, queues, and deques from "Learning to Transduce with Unbounded Memory"☆20Sep 8, 2015Updated 10 years ago
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Nov 23, 2023Updated 2 years ago
- Automatic Differentiation for OpenCL.☆20Mar 4, 2015Updated 11 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆190Jan 26, 2026Updated last month
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 3 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆29Jul 9, 2024Updated last year
- Web app to annotate word onsets and offsets on spectrograms☆28Aug 12, 2022Updated 3 years ago
- ☆32Jul 27, 2022Updated 3 years ago
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆68Dec 13, 2021Updated 4 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- Implementation for "Rational Recurrences", Peng et al., EMNLP 2018.☆28Jun 21, 2022Updated 3 years ago
- In-the-wild deepfake detection dataset☆13Mar 5, 2025Updated 11 months ago
- ☆12Nov 9, 2015Updated 10 years ago
- Recursive Neural Tensor Networks☆11Feb 3, 2014Updated 12 years ago
- A Chainer implementation of WaveGlow.☆41Jan 26, 2019Updated 7 years ago
- Home Assistant integration for Hoymiles Cloud API, primarily developed for HYT inverters with battery storage systems. This integration p…☆14Updated this week
- Viterbi decoding in PyTorch☆41Sep 10, 2025Updated 5 months ago
- Library for research in audio analysis, processing and synthesis☆47Jun 22, 2016Updated 9 years ago
- STM32746G-DISCOVERY platform - GCC Makefile project templates and experiments☆11Dec 23, 2015Updated 10 years ago