zakuro-ai / asrLinks
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
☆69Updated 2 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below
Sorting:
- context labels and pronunciation data for JSUT corpus☆73Updated 4 years ago
- One-button-press forced aligner for Japanese, using Julius.☆46Updated 2 years ago
- ☆225Updated last year
- Python wrapper for OpenJTalk☆229Updated 4 months ago
- ESPnet Model Zoo☆254Updated 2 years ago
- ☆87Updated 4 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆125Updated 3 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆172Updated 2 years ago
- Onnx wrapper for espnet infrernce model☆168Updated 3 weeks ago
- Official implementation of the source-filter HiFiGAN vocoder☆259Updated 2 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆254Updated last year
- real time japanese speech recognition translator using wav2vec2☆39Updated 3 years ago
- ☆35Updated 2 years ago
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆20Updated 2 years ago
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆249Updated 2 years ago
- Speech Segmentation Toolkit using Julius☆18Updated 4 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆15Updated 2 years ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆111Updated 3 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated last year
- A fork of open_jtalk☆61Updated 5 months ago
- Massive open Japanese speech corpus☆323Updated last week
- ☆28Updated 4 years ago
- JVS (Japanese versatile speech) コーパスの自作のラベル☆31Updated 4 years ago
- A public domain single speaker Japanese speech dataset☆55Updated last year
- xvector model on jtubespeech☆45Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆84Updated 2 years ago
- ☆32Updated 2 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 4 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆229Updated 3 years ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆261Updated last month