rafaelvalle / asrgenView external linksLinks
Attacking Speaker Recognition with Deep Generative Models
☆34Mar 24, 2023Updated 2 years ago
Alternatives and similar repositories for asrgen
Users that are interested in asrgen are comparing it to the libraries listed below
Sorting:
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- Speech enhancement using mimic loss☆16Oct 25, 2019Updated 6 years ago
- Fast spectrogram phase recovery using Local Weighted Sums (C/Python/Matlab)☆117Nov 28, 2023Updated 2 years ago
- ☆42Oct 30, 2018Updated 7 years ago
- Analytic signal-based source information analysis for YANGstraight and real-time interactive tools☆34Aug 20, 2019Updated 6 years ago
- A fast cnn-based vocoder☆78Jun 11, 2020Updated 5 years ago
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Nov 26, 2018Updated 7 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 6 years ago
- Supplemental material for the paper "Towards Automatically Correcting Tapped Beat Annotations for Music Recordings"☆20May 6, 2021Updated 4 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆125Mar 29, 2019Updated 6 years ago
- Bayesian spEEch Recognizer☆55Jan 11, 2021Updated 5 years ago
- ☆18Feb 9, 2020Updated 6 years ago
- melodic object transcription framework☆26Nov 15, 2017Updated 8 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- ☆56Aug 21, 2018Updated 7 years ago
- ☆24Oct 9, 2018Updated 7 years ago
- WaveGlow vocoder with VQVAE☆61Jun 18, 2019Updated 6 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)☆22Oct 14, 2017Updated 8 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- The repository for 'jipci', the Just Intonation Pansophical Conversion Instrument.☆10Dec 28, 2022Updated 3 years ago
- Generation tool for offset-resistant audio adversarial examples against Deepspeech☆10Oct 5, 2020Updated 5 years ago
- A Text2Speech Engine built in Pytorch.☆12Dec 9, 2018Updated 7 years ago
- ☆17Jul 29, 2018Updated 7 years ago
- ☆11May 4, 2020Updated 5 years ago
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- Stochastic multi-stream sampling for iterative learning☆81Mar 12, 2024Updated last year
- Core code for my ICASSP 2018 paper☆53Jul 27, 2018Updated 7 years ago
- Speech Enhancement using Bayesian WaveNet☆98Apr 1, 2018Updated 7 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Wavelet phase harmonic scattering transform☆12Jul 5, 2022Updated 3 years ago
- A powerful tool to design any tensor factorization model and estimate the corresponding parameters☆12Sep 24, 2025Updated 4 months ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Apr 8, 2020Updated 5 years ago