kaiidams/Kokoro-Speech-Dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kaiidams/Kokoro-Speech-Dataset)

kaiidams / Kokoro-Speech-Dataset

A public domain single speaker Japanese speech dataset

☆68

Alternatives and similar repositories for Kokoro-Speech-Dataset

Users that are interested in Kokoro-Speech-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sarulab-speech / jsut-label
View on GitHub
context labels and pronunciation data for JSUT corpus
☆77Sep 2, 2021Updated 4 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
6gsn / marine
View on GitHub
☆38Sep 20, 2022Updated 3 years ago
hlt-mt / Speech-MASSIVE
View on GitHub
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆25Oct 8, 2025Updated 9 months ago
lingjzhu / spoken_sent_embedding
View on GitHub
Unsupervised spoken sentence embeddings
☆14Dec 14, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
maum-ai / sane-tts
View on GitHub
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
☆11Jun 30, 2023Updated 3 years ago
JRMeyer / jphones
View on GitHub
A Python3 program for converting Japanese words and numbers into phonemes.
☆18Apr 24, 2018Updated 8 years ago
exercise-book-yq / Supercodec
View on GitHub
☆51Mar 5, 2026Updated 4 months ago
PalabraAI / redimnet2
View on GitHub
This repository contains the official implementation and pretrained weights for the paper "ReDimNet2: Scaling Speaker Verification via Ti…
☆65Jul 9, 2026Updated 2 weeks ago
kamperh / globalphone_awe
View on GitHub
Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.
☆11Nov 3, 2020Updated 5 years ago
xinshengwang / robpitch
View on GitHub
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Jan 21, 2025Updated last year
amanoese / kana2ipa
View on GitHub
「ひらがな」または「カタカナ」を日本語で発音する際の音声記号(IPA)に変換するコマンド
☆19Jan 5, 2023Updated 3 years ago
RF5 / transfusion-asr
View on GitHub
Transcribing Speech with Multinomial Diffusion, training code and models.
☆80Sep 27, 2023Updated 2 years ago
litagin02 / anime_speaker_embedding
View on GitHub
Speaker embedding for anime speech domain based on ECAPA_TDNN
☆21Jun 22, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tennmoku71 / advent_calendar_cyberagent_llm_dialogue_system
View on GitHub
☆11Jan 10, 2024Updated 2 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
keonlee9420 / Comprehensive-E2E-TTS
View on GitHub
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…
☆147Jun 6, 2022Updated 4 years ago
laksjdjf / pfg
View on GitHub
☆20Mar 28, 2023Updated 3 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
ydqmkkx / Respiro-en
View on GitHub
Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…
☆44Sep 18, 2024Updated last year
liduojia1 / MeanFlowSE
View on GitHub
☆43Jan 26, 2026Updated 5 months ago
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
jzmzhong / Automatic-Prosody-Annotator-with-SSWP-CLAP
View on GitHub
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
☆50Jun 11, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
seongmin-mun / KoG2Padvanced
View on GitHub
☆21Jul 16, 2023Updated 3 years ago
sil-ai / tts-singlish
View on GitHub
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.
☆11Jan 11, 2020Updated 6 years ago
inspection-ai / japanese-toxic-dataset
View on GitHub
☆22Jan 11, 2023Updated 3 years ago
Deepest-Project / AlignTTS
View on GitHub
Implementation of the AlignTTS
☆77Jul 6, 2023Updated 3 years ago
hotchpotch / fast-bunkai
View on GitHub
⚡Japanese sentence splitting(日本語文境界判定器), 40–250× faster via a Rust-accelerated Python library with near-perfect API compatibility with …
☆75Oct 14, 2025Updated 9 months ago
lingjzhu / CharsiuG2P
View on GitHub
Multilingual G2P in 100 languages
☆390May 26, 2023Updated 3 years ago
Deep-unlearning / Llasa-GRPO
View on GitHub
☆18Nov 19, 2025Updated 8 months ago
mt-upc / ZeroSwot
View on GitHub
Pushing the Limits of Zero-shot End-to-End Speech Translation
☆25Dec 12, 2024Updated last year
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Adibian / Persian-MultiSpeaker-Tacotron2
View on GitHub
Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.
☆13Oct 2, 2025Updated 9 months ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 4 years ago
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
Wataru-Nakata / latentlm-tts
View on GitHub
☆29Jul 3, 2026Updated 3 weeks ago
nii-yamagishilab / ZMM-TTS
View on GitHub
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
☆185Mar 6, 2024Updated 2 years ago
r9y9 / pyopenjtalk
View on GitHub
Python wrapper for OpenJTalk
☆255Apr 8, 2025Updated last year
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago