idiap/IdiapTTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/idiap/IdiapTTS)

idiap / IdiapTTS

A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis

☆23

Alternatives and similar repositories for IdiapTTS

Users that are interested in IdiapTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

speech-utcluj / thetaOscillator-syllable-segmentation
View on GitHub
Oscillator-based speech syllabification algorithm
☆11Sep 27, 2019Updated 6 years ago
neulab / AfricanVoices
View on GitHub
Hosts text-to-speech corpus and speech synthesizers for African languages.
☆19May 31, 2023Updated 3 years ago
idiap / inv-tn
View on GitHub
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)
☆21Sep 27, 2017Updated 8 years ago
idiap / libssp
View on GitHub
Speech Signal Processing - C++ port of a subset of the Python library SSP
☆17Dec 24, 2020Updated 5 years ago
WelkinYang / EMPHASIS-pytorch
View on GitHub
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System
☆15Mar 31, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
idiap / asrt
View on GitHub
Various scripts that facilitate the preparation of Automatic Speech Recognition related resources
☆17Apr 16, 2020Updated 6 years ago
dr-pato / SSGD
View on GitHub
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆15Dec 22, 2022Updated 3 years ago
candlewill / RawNet
View on GitHub
RawNet: Fast End-to-End Neural Vocoder
☆43May 29, 2019Updated 7 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
michaelneri / unsupervised-audio-anomaly-detection
View on GitHub
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …
☆11Nov 6, 2024Updated last year
kimsunwiub / BLOOM-Net
View on GitHub
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
☆14Feb 13, 2022Updated 4 years ago
hhguo / WaveRNN
View on GitHub
Based on https://github.com/fatchord/WaveRNN
☆24May 3, 2020Updated 6 years ago
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
RicherMans / SAT
View on GitHub
Streaming Audiotransformers for online Audio tagging
☆57Jun 14, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
idiap / sparch
View on GitHub
PyTorch based toolkit for developing spiking neural networks (SNNs) by training and testing them on speech command recognition tasks
☆32May 3, 2024Updated 2 years ago
yongxuUSTC / mtmvdr
View on GitHub
Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020
☆16Oct 20, 2020Updated 5 years ago
Koziev / StressModel
View on GitHub
Neural model for prediction of stress position in Russian words
☆13Jun 22, 2025Updated last year
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
haoheliu / DCASE_2022_Task_5
View on GitHub
System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection
☆28Jul 6, 2022Updated 4 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
gusrud1103 / LibriPhrase
View on GitHub
Recipe for LibriPhrase
☆38Sep 2, 2023Updated 2 years ago
stephengrice / synth-me
View on GitHub
Basic concatenative text-to-speech implementation in Python
☆20Aug 31, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ljuvela / GELP
View on GitHub
☆27Apr 21, 2021Updated 5 years ago
funcwj / aps
View on GitHub
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
☆146Jul 6, 2023Updated 3 years ago
fgnt / paderbox
View on GitHub
Paderbox: A collection of utilities for audio / speech processing
☆43Jul 21, 2025Updated last year
Wendison / FCL-taco2
View on GitHub
Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021
☆41Jul 17, 2021Updated 5 years ago
RuABraun / texterrors
View on GitHub
☆37Jun 9, 2026Updated last month
PyThaiNLP / thai-g2p-wiktionary-corpus
View on GitHub
Thai Grapheme to Phoneme (G2P) Wiktionary Corpus
☆13Jul 25, 2022Updated 3 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
coryshain / dnnseg
View on GitHub
☆11Mar 20, 2021Updated 5 years ago
MichaelLLi / Text_Normalization
View on GitHub
A text normalization framework using GBM and human-generated features
☆10Feb 4, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
Hayeonbang / PIAST
View on GitHub
A piano music dataset with Audio, Symbolic and Text labels
☆36Mar 6, 2025Updated last year
hecko-yes / tts-dataset-prompts
View on GitHub
Finally, some decent sample sentences
☆24Dec 3, 2023Updated 2 years ago
yucongzh / online_speaker_diarization
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
mathigatti / MellotronCPU
View on GitHub
Mellotron singing synthesizer using CPU
☆13Mar 24, 2023Updated 3 years ago
dan-wells / kiss-aligner
View on GitHub
Simple Kaldi recipe for forced alignment
☆11Jul 16, 2023Updated 3 years ago
jgarciapueyo / MelNet-SpeechGeneration
View on GitHub
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
☆25Sep 16, 2020Updated 5 years ago