Respaired/Tsukasa-Speech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Respaired/Tsukasa-Speech)

Respaired / Tsukasa-Speech

a Frontier Japanese Speech Generation net

☆65

Alternatives and similar repositories for Tsukasa-Speech

Users that are interested in Tsukasa-Speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ShoukanLabs / VoPho
View on GitHub
A collection of all our phonemeizers for dataset construction and inference
☆30Feb 21, 2025Updated last year
Deep-unlearning / Llasa-GRPO
View on GitHub
☆18Nov 19, 2025Updated 8 months ago
Stylish-TTS / stylish-tts
View on GitHub
High quality text-to-speech based on StyleTTS 2.
☆78Apr 6, 2026Updated 3 months ago
Parakeet-Inc / J-HARD-TTS-Eval
View on GitHub
☆21Jan 28, 2026Updated 6 months ago
IIEleven11 / Automatic-Audio-Dataset-Maker
View on GitHub
Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.
☆48Sep 15, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
smallbraineng / smalltts
View on GitHub
superfast text to speech in any voice
☆62Feb 16, 2026Updated 5 months ago
taresh18 / orpheus-streaming
View on GitHub
Orpheus TTS Server with streaming support (TTFB ~160ms)
☆26Sep 21, 2025Updated 10 months ago
tonnetonne814 / MB-iSTFT-VITS-44100-Ja
View on GitHub
44100Hz日本語音源に対応した MB-iSTFT-VITS: Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Tim…
☆39Jun 2, 2023Updated 3 years ago
Zhongxu-Wang / ArtSpeech
View on GitHub
ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations
☆22Sep 21, 2025Updated 10 months ago
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
IDEA-Emdoor-Lab / UniTTS
View on GitHub
A TTS Trained on Universal Audio.
☆41Jun 6, 2025Updated last year
AshwinSankar17 / intro-to-tts
View on GitHub
A notebooks based (soft) intro to modern TTS
☆18Jun 8, 2025Updated last year
sarulab-speech / Coco-Nut
View on GitHub
Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus
☆21Jun 12, 2024Updated 2 years ago
Zuntan03 / EasyLlasa
View on GitHub
EasyLlasa は 5～15秒の日本語音声と日本語テキストから日本語音声を生成する TSTS (TextSpeechToSpeech) です。
☆26Sep 29, 2025Updated 10 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
DwangoMediaVillage / pydomino
View on GitHub
日本語音声に対して音素ラベルをアラインメントするためのツールです
☆40Aug 19, 2025Updated 11 months ago
Plachtaa / ASTRAL-quantization
View on GitHub
speaker-disentangled speech linguistic content quantizer
☆26Mar 19, 2025Updated last year
innnky / MagVITS
View on GitHub
VITS with phoneme-level prosody modeling based on MaskGIT
☆85Aug 31, 2024Updated last year
tonnetonne814 / unofficial-vits2-44100-Ja
View on GitHub
44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。
☆24Sep 1, 2023Updated 2 years ago
litagin02 / anime_speaker_embedding
View on GitHub
Speaker embedding for anime speech domain based on ECAPA_TDNN
☆21Jun 22, 2025Updated last year
uthree / auris_experimental_vits_dsp
View on GitHub
AI based singing voice synthesis
☆37Jun 10, 2024Updated 2 years ago
YangXusheng-yxs / CodecFormer_5Hz
View on GitHub
☆35Oct 23, 2025Updated 9 months ago
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
liuhuang31 / HiFTNet-sr
View on GitHub
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Jan 2, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tonnetonne814 / SiFi-VITS2-44100-Ja
View on GitHub
DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.
☆55Sep 25, 2023Updated 2 years ago
syoyo / espnet-tts-streamlit
View on GitHub
ESPNet TTS with Streamlit GUI
☆14Apr 30, 2023Updated 3 years ago
jakariaemon / WSI
View on GitHub
Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.
☆26Jun 29, 2026Updated last month
line / promptttspp
View on GitHub
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
☆86Oct 11, 2024Updated last year
primepake / F5-TTS-meanflow-multilingual
View on GitHub
Meanflow and multilingual for F5-TTS model
☆16Aug 23, 2025Updated 11 months ago
nu-dialogue / moshi-finetune
View on GitHub
Fine-tuning Moshi/J-Moshi on your own spoken dialogue data
☆101Jan 5, 2026Updated 6 months ago
neodyland / sbv2-api
View on GitHub
Infer only tts
☆49Jul 13, 2026Updated 2 weeks ago
idiap / knn-tts
View on GitHub
Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model
☆36Apr 29, 2025Updated last year
yxlu-0102 / IDEA-TTS
View on GitHub
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis
☆27Mar 21, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
davidmartinrius / speech-dataset-generator
View on GitHub
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
☆262Jun 10, 2024Updated 2 years ago
ictnlp / SLED-TTS
View on GitHub
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
☆108May 20, 2025Updated last year
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
yl4579 / DMOSpeech2
View on GitHub
☆302Jul 22, 2025Updated last year
ozspeech / OZSpeech
View on GitHub
[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching
☆45Feb 9, 2025Updated last year
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
litagin02 / Aivis-Dataset
View on GitHub
💠 Aivis: AI Voice Imitation System
☆27Feb 25, 2024Updated 2 years ago