zakuro-ai / asrLinks
ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
☆69Updated 3 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below
Sorting:
- ☆228Updated 2 years ago
- context labels and pronunciation data for JSUT corpus☆77Updated 4 years ago
- ESPnet Model Zoo☆258Updated 2 years ago
- ☆89Updated 4 years ago
- Python wrapper for OpenJTalk☆241Updated 8 months ago
- One-button-press forced aligner for Japanese, using Julius.☆47Updated 2 years ago
- Deep neural network (DNN) for noise reduction, removal of background music, and speech separation☆173Updated 3 years ago
- [WIP] Scripts for fine-tuning Whisper☆222Updated 2 years ago
- A public domain single speaker Japanese speech dataset☆63Updated 2 years ago
- Onnx wrapper for espnet infrernce model☆169Updated 4 months ago
- ☆27Updated 4 years ago
- A fork of open_jtalk☆68Updated 8 months ago
- ☆74Updated last year
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆218Updated 2 years ago
- ☆32Updated 3 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 3 years ago
- ☆36Updated 3 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- Speech Segmentation Toolkit using Julius☆18Updated 4 years ago
- HTS-style full-context labels for JSUT v1.1☆50Updated 4 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆263Updated 2 years ago
- real time japanese speech recognition translator using wav2vec2☆39Updated 3 years ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆113Updated 3 years ago
- xvector model on jtubespeech☆46Updated 2 years ago
- Voice Conversion Tool Kit☆607Updated 2 years ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆232Updated 3 years ago
- Library to build speech synthesis systems designed for easy and fast prototyping.☆399Updated last year
- Massive open Japanese speech corpus☆344Updated 2 months ago
- Speech noise reduction which was generated using existing post-production techniques implemented in Python☆181Updated 4 years ago
- ☆204Updated 3 years ago