zakuro-ai / asrLinks
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
☆69Updated 3 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below
Sorting:
- ☆229Updated 2 years ago
- context labels and pronunciation data for JSUT corpus☆77Updated 4 years ago
- ESPnet Model Zoo☆259Updated 2 years ago
- Python wrapper for OpenJTalk☆241Updated 9 months ago
- Onnx wrapper for espnet infrernce model☆168Updated 5 months ago
- ☆88Updated 4 years ago
- One-button-press forced aligner for Japanese, using Julius.☆47Updated 2 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 3 years ago
- ☆36Updated 3 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆15Updated 2 years ago
- xvector model on jtubespeech☆46Updated 2 years ago
- This repository is a collection of TTS Models in TFLite☆201Updated 4 years ago
- HTS-style full-context labels for JSUT v1.1☆50Updated 4 years ago
- A fork of open_jtalk☆69Updated 9 months ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 3 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆266Updated 2 years ago
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆20Updated 2 years ago
- ☆32Updated 3 years ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆113Updated 3 years ago
- [WIP] Scripts for fine-tuning Whisper☆222Updated 2 years ago
- 日本語音声に対して音素ラベルをアラインメントするためのツールです☆36Updated 5 months ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- ☆77Updated last year
- This repository contains the scripts to use CURRENNT☆66Updated 5 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated 2 years ago
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Updated 2 years ago
- A public domain single speaker Japanese speech dataset☆63Updated 2 years ago
- ☆24Updated 5 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆232Updated 3 years ago
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆258Updated 2 years ago