zakuro-ai / asrLinks
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
☆68Updated 2 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below
Sorting:
- Python wrapper for OpenJTalk☆225Updated 3 months ago
- context labels and pronunciation data for JSUT corpus☆71Updated 3 years ago
- ☆222Updated last year
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆172Updated 2 years ago
- One-button-press forced aligner for Japanese, using Julius.☆46Updated 2 years ago
- ☆87Updated 4 years ago
- real time japanese speech recognition translator using wav2vec2☆39Updated 2 years ago
- ESPnet Model Zoo☆255Updated 2 years ago
- ☆27Updated 4 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆251Updated 11 months ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆125Updated 3 years ago
- ☆34Updated 2 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆320Updated 11 months ago
- Onnx wrapper for espnet infrernce model☆166Updated 9 months ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆110Updated 3 years ago
- ☆32Updated 2 years ago
- Speech Segmentation Toolkit using Julius☆18Updated 3 years ago
- ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)☆260Updated 2 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 3 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆229Updated 3 years ago
- A public domain single speaker Japanese speech dataset☆54Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84Updated 2 years ago
- A fork of open_jtalk☆60Updated 3 months ago
- xvector model on jtubespeech☆45Updated last year
- Official implementation of the source-filter HiFiGAN vocoder☆255Updated last year
- Singing Voice Synthesis based on VITS, different from VISinger☆189Updated last year
- HTS-style full-context labels for JSUT v1.1☆47Updated 4 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆194Updated 2 years ago
- Voice Conversion by CycleGAN (语音克隆/语音转换):CycleGAN-VC3☆150Updated 3 years ago
- Text to Speech with PyTorch (English and Mongolian)☆185Updated 9 months ago