codemandosch / taco2swe
A modification of https://github.com/Rayhane-mamah/Tacotron-2 that is intended for use with the Swedish language.
☆10Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for taco2swe
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated 8 months ago
- ☆16Updated 3 years ago
- Multilingual Grapheme to Phoneme☆49Updated 8 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Deep understanding and modelling of the hierarchical structure of prosody☆22Updated 5 years ago
- This is now the official location of the Kaldi project.☆13Updated 5 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆14Updated 4 years ago
- ☆17Updated last year
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- PAVOQUE Corpus of Expressive Speech☆12Updated 8 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆30Updated 4 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆15Updated 3 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆37Updated 2 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- A handy dataset of noises for ASR☆19Updated 5 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- tensorflow speech synthesis c++ inference for voicenet☆16Updated 5 years ago
- ☆22Updated 3 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago
- ☆77Updated 6 months ago
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Updated 5 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Coqui Inference Engine☆38Updated 3 years ago