zakuro-ai / asr
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
☆68Updated 2 years ago
Alternatives and similar repositories for asr:
Users that are interested in asr are comparing it to the libraries listed below
- context labels and pronunciation data for JSUT corpus☆68Updated 3 years ago
- Python wrapper for OpenJTalk☆215Updated this week
- ☆86Updated 4 years ago
- ☆217Updated last year
- ☆18Updated 4 years ago
- ☆32Updated 2 years ago
- One-button-press forced aligner for Japanese, using Julius.☆44Updated last year
- Onnx wrapper for espnet infrernce model☆161Updated 5 months ago
- ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)☆258Updated 2 years ago
- ☆33Updated 2 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆249Updated last year
- real time japanese speech recognition translator using wav2vec2☆37Updated 2 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆14Updated last year
- ESPnet Model Zoo☆248Updated last year
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆110Updated 2 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆27Updated last year
- HTS-style full-context labels for JSUT v1.1☆46Updated 3 years ago
- A fork of open_jtalk☆57Updated this week
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆23Updated last year
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆54Updated last year
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 3 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 2 years ago
- Japanese dictation kit using Julius☆157Updated 5 years ago
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆19Updated last year
- JVS (Japanese versatile speech) コーパスの自作のラベル☆31Updated 3 years ago
- Massive open Japanese speech corpus☆278Updated last week
- Speech Segmentation Toolkit using Julius☆18Updated 3 years ago
- An advance kaldi wrapper for Pyhton☆38Updated 4 years ago
- convert .lab files to .TextGrid files, which can be used in Praat☆14Updated 6 years ago
- Nue-ASR inference code by rinna Co., Ltd.☆32Updated 8 months ago