JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit
☆44May 26, 2025Updated 9 months ago
Alternatives and similar repositories for jatts
Users that are interested in jatts are comparing it to the libraries listed below
Sorting:
- pyopenjtalk-plus: A Python wrapper for OpenJTalk with additional improvements☆56Nov 18, 2025Updated 3 months ago
- ☆10Oct 16, 2025Updated 4 months ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- Enhanced Piper TTS with Japanese support, WebAssembly, multi-GPU training, and quality improvements. Features OpenJTalk integration, brow…☆29Feb 22, 2026Updated last week
- ☆15Nov 10, 2025Updated 3 months ago
- ☆13Jul 10, 2021Updated 4 years ago
- ☆22Jul 30, 2025Updated 7 months ago
- A real-time and light-weight software for generation of non-linguistic behaviors (turn-taking, backchannel, and head-nodding) in conversa…☆81Feb 20, 2026Updated last week
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- ☆13Mar 11, 2025Updated 11 months ago
- PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.☆63Sep 8, 2025Updated 5 months ago
- DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.☆54Sep 25, 2023Updated 2 years ago
- ☆60Jan 8, 2025Updated last year
- 2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…☆62Sep 1, 2024Updated last year
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 8 months ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- Survey of audio language models☆62Feb 4, 2026Updated last month
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Aug 29, 2024Updated last year
- silero-vad pytorch implement☆35Nov 23, 2024Updated last year
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆87Dec 20, 2024Updated last year
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 6 months ago
- Open TTS models, built for streaming on the edge☆45Mar 16, 2025Updated 11 months ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated 11 months ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Jun 5, 2023Updated 2 years ago
- Official release of StyleTalk dataset.☆72Jul 1, 2024Updated last year
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆36Feb 11, 2025Updated last year
- This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).☆37Mar 12, 2025Updated 11 months ago
- My vocoder experiments☆31Jul 26, 2025Updated 7 months ago
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆20Oct 11, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 7 months ago
- ☆36Sep 6, 2025Updated 5 months ago
- 日本語音声に対して音素ラベルをアラインメントするためのツールです☆36Aug 19, 2025Updated 6 months ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆61Jul 1, 2025Updated 8 months ago