A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis
☆23Dec 31, 2021Updated 4 years ago
Alternatives and similar repositories for IdiapTTS
Users that are interested in IdiapTTS are comparing it to the libraries listed below
Sorting:
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- ☆17Oct 16, 2018Updated 7 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- ☆15Jul 11, 2022Updated 3 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆22Jan 18, 2023Updated 3 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Dec 8, 2022Updated 3 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Hybrid speech synthesiser☆28Feb 18, 2019Updated 7 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Mirror of GlottHMM☆10Jun 7, 2016Updated 9 years ago
- Database of annotated field recording samples that can be used for training audio labelling algorithms☆10Feb 1, 2019Updated 7 years ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- ☆13Nov 22, 2022Updated 3 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- VoxAngeles Corpus☆13Aug 23, 2025Updated 6 months ago
- ☆10Mar 20, 2021Updated 4 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- ☆22Apr 8, 2022Updated 3 years ago
- Streaming Audiotransformers for online Audio tagging☆52Jun 14, 2024Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆48Nov 4, 2020Updated 5 years ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆13Jul 25, 2022Updated 3 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 8 months ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Spiking neural networks (SNNs) for speech classification☆12Mar 14, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year