zakuro-ai / asrLinks
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
☆69Updated 3 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below
Sorting:
- context labels and pronunciation data for JSUT corpus☆76Updated 4 years ago
- ESPnet Model Zoo☆256Updated 2 years ago
- Onnx wrapper for espnet infrernce model☆169Updated 3 months ago
- ☆89Updated 4 years ago
- ☆227Updated 2 years ago
- Python wrapper for OpenJTalk☆240Updated 7 months ago
- One-button-press forced aligner for Japanese, using Julius.☆47Updated 2 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 3 years ago
- This repository is a collection of TTS Models in TFLite☆201Updated 4 years ago
- Voice Conversion Tool Kit☆606Updated 2 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆112Updated 3 years ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆191Updated 2 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆232Updated 3 years ago
- real time japanese speech recognition translator using wav2vec2☆39Updated 3 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 3 years ago
- ☆32Updated 2 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆264Updated 2 years ago
- [WIP] Scripts for fine-tuning Whisper☆223Updated 2 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 4 years ago
- Library to build speech synthesis systems designed for easy and fast prototyping.☆399Updated last year
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆162Updated 4 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆162Updated this week
- Tacotron2 + LPCNET for complete End-to-End TTS System☆93Updated 2 years ago
- A public domain single speaker Japanese speech dataset☆63Updated 2 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…☆311Updated 4 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆218Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- A fork of open_jtalk☆67Updated 8 months ago
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆260Updated 6 years ago