zakuro-ai / asr
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
☆68Updated last year
Related projects: ⓘ
- context labels and pronunciation data for JSUT corpus☆64Updated 3 years ago
- ☆83Updated 3 years ago
- ☆210Updated 10 months ago
- A public domain single speaker Japanese speech dataset☆34Updated 10 months ago
- One-button-press forced aligner for Japanese, using Julius.☆43Updated last year
- Official implementation of the source-filter HiFiGAN vocoder☆233Updated last year
- Python wrapper for OpenJTalk☆195Updated 2 months ago
- Onnx wrapper for espnet infrernce model☆152Updated 2 months ago
- ESPnet Model Zoo☆242Updated last year
- HTS-style full-context labels for JSUT v1.1☆45Updated 3 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆79Updated last year
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆105Updated 2 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆26Updated 5 months ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆69Updated 3 years ago
- ☆31Updated last year
- Singing Voice Synthesis based on VITS, different from VISinger☆182Updated 10 months ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆317Updated last month
- Speech Segmentation Toolkit using Julius☆17Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆186Updated 3 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆108Updated 2 months ago
- An advance kaldi wrapper for Pyhton☆38Updated 3 years ago
- ☆57Updated 2 weeks ago
- Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.☆143Updated 2 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆217Updated last month
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆95Updated last year
- CURRENNNT codes and scripts☆77Updated 4 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆223Updated 2 years ago
- pytorch implementation of DNN-HSMM for TTS☆60Updated 3 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Updated last year