timmahrt / pyJuliusAlign
One-button-press forced aligner for Japanese, using Julius.
☆44Updated last year
Alternatives and similar repositories for pyJuliusAlign:
Users that are interested in pyJuliusAlign are comparing it to the libraries listed below
- context labels and pronunciation data for JSUT corpus☆69Updated 3 years ago
- ☆86Updated 4 years ago
- convert .lab files to .TextGrid files, which can be used in Praat☆14Updated 6 years ago
- Speech Segmentation Toolkit using Julius☆18Updated 3 years ago
- HTS-style full-context labels for JSUT v1.1☆47Updated 4 years ago
- ☆152Updated last year
- Official implementation of the source-filter HiFiGAN vocoder☆251Updated last year
- A differentiable version of SPTK☆182Updated this week
- ☆218Updated last year
- Python package implementing the TD-PSOLA algorithm for speech processing☆42Updated 7 years ago
- JVS (Japanese versatile speech) コーパスの自作のラベル☆31Updated 4 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆27Updated last year
- CURRENNNT codes and scripts☆77Updated 4 years ago
- SelfRemaster: SSL Speech Restoration☆88Updated last year
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆110Updated 2 years ago
- pytorch implementation of Neural Parametric Singing Synthesizer 歌声合成☆155Updated 3 years ago
- Yin pitch estimator in PyTorch☆114Updated 2 years ago
- A suite of speech signal processing tools☆232Updated last month
- Speech Segmentation Toolkit using Julius☆92Updated 5 years ago
- Fully-Convolutional Network for Pitch Estimation of Speech Signals☆56Updated 2 years ago
- 東北イタコ歌唱データベースの最新ラベルデータ☆20Updated 3 years ago
- A Toolkit for ToBI Labeling with Python Data Structures☆24Updated 2 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated 9 months ago
- MelGAN implementation with Multi-Band and Full Band supports...☆61Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆60Updated 2 years ago
- An 16kHz implementation of HiFi-GAN for soft-vc.☆98Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languages☆135Updated last year
- ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.☆68Updated 2 years ago
- ☆96Updated last year
- ☆69Updated 4 years ago