zakuro-ai / asrLinks
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
☆68Updated 2 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below
Sorting:
- context labels and pronunciation data for JSUT corpus☆70Updated 3 years ago
- ☆221Updated last year
- One-button-press forced aligner for Japanese, using Julius.☆44Updated last year
- Python wrapper for OpenJTalk☆223Updated 2 months ago
- ☆87Updated 4 years ago
- Onnx wrapper for espnet infrernce model☆162Updated 8 months ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆110Updated 3 years ago
- A fork of open_jtalk☆58Updated 2 months ago
- ☆20Updated 4 years ago
- HTS-style full-context labels for JSUT v1.1☆47Updated 4 years ago
- ESPnet Model Zoo☆251Updated last year
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Updated last year
- ☆32Updated 2 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆172Updated 2 years ago
- ☆34Updated 2 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆194Updated 2 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆28Updated last year
- xvector model on jtubespeech☆45Updated last year
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆125Updated 3 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆253Updated last year
- Singing Voice Synthesis based on VITS, different from VISinger☆190Updated last year
- Neural HMMs are all you need (for high-quality attention-free TTS)☆158Updated last week
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆96Updated 2 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Updated 4 years ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆214Updated last year
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 3 years ago
- ☆152Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆249Updated 10 months ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset☆348Updated 3 years ago
- Efficient neural speech synthesis☆80Updated 4 years ago