zakuro-ai / asrLinks
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
☆68Updated 2 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below
Sorting:
- context labels and pronunciation data for JSUT corpus☆74Updated 4 years ago
- ESPnet Model Zoo☆256Updated 2 years ago
- ☆226Updated last year
- One-button-press forced aligner for Japanese, using Julius.☆46Updated 2 years ago
- Python wrapper for OpenJTalk☆236Updated 6 months ago
- real time japanese speech recognition translator using wav2vec2☆39Updated 3 years ago
- ☆89Updated 4 years ago
- Onnx wrapper for espnet infrernce model☆169Updated 2 months ago
- ☆27Updated 4 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 3 years ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆111Updated 3 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 2 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 4 years ago
- ☆32Updated 2 years ago
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆20Updated 2 years ago
- [WIP] Scripts for fine-tuning Whisper☆222Updated 2 years ago
- xvector model on jtubespeech☆45Updated last year
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Updated 2 years ago
- ☆35Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆199Updated 4 years ago
- Nue-ASR inference code by rinna Co., Ltd.☆35Updated last month
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆15Updated 2 years ago
- A public domain single speaker Japanese speech dataset☆61Updated last year
- ☆67Updated 4 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated 2 years ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆162Updated 4 years ago
- ☆64Updated last year
- Non Parallel Voice Conversion based on VITS☆24Updated 2 years ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆130Updated 11 months ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Updated 5 years ago