reazon-research/ReazonSpeech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/reazon-research/ReazonSpeech)

reazon-research / ReazonSpeech

Massive open Japanese speech corpus

☆389

Alternatives and similar repositories for ReazonSpeech

Users that are interested in ReazonSpeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
6gsn / marine
View on GitHub
☆38Sep 20, 2022Updated 3 years ago
sarulab-speech / jsut-label
View on GitHub
context labels and pronunciation data for JSUT corpus
☆77Sep 2, 2021Updated 4 years ago
yangdongchao / SimpleSpeech
View on GitHub
The open source code for SimpleSpeech series
☆147Oct 8, 2024Updated last year
PKSHATechnology-Research / tdmelodic
View on GitHub
A Japanese accent dictionary generator
☆126Mar 21, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
sarulab-speech / tdmelodic_openjtalk
View on GitHub
tdmelodic for open-jtalk
☆26Aug 30, 2021Updated 4 years ago
csukuangfj / kaldi-hmm-gmm
View on GitHub
☆28Apr 24, 2026Updated 2 months ago
laboroai / LaboroTVSpeech
View on GitHub
☆90Mar 5, 2021Updated 5 years ago
wenet-e2e / WeSpeech-AI
View on GitHub
Open Source Speech/Text Data on AI
☆19Sep 13, 2022Updated 3 years ago
line / LibriTTS-P
View on GitHub
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
☆161Jun 13, 2024Updated 2 years ago
sp-nitech / diffsptk
View on GitHub
A differentiable version of SPTK
☆201Jul 14, 2026Updated last week
laboroai / TEDxJP-10K
View on GitHub
☆26Jan 14, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yl4579 / PitchExtractor
View on GitHub
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
☆151Aug 22, 2022Updated 3 years ago
anonymous-pits / pits
View on GitHub
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆280Jul 16, 2023Updated 3 years ago
hhguo / WaveRNN
View on GitHub
Based on https://github.com/fatchord/WaveRNN
☆24May 3, 2020Updated 6 years ago
hhguo / SoCodec
View on GitHub
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆92Dec 20, 2024Updated last year
sarulab-speech / Coco-Nut
View on GitHub
Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus
☆21Jun 12, 2024Updated 2 years ago
sp-nitech / SPTK
View on GitHub
A suite of speech signal processing tools
☆246Updated this week
Mddct / transformer-vocos
View on GitHub
☆35Sep 6, 2025Updated 10 months ago
xinjli / alqalign
View on GitHub
multilingual speech aligner
☆78Nov 19, 2023Updated 2 years ago
MaAI-Kyoto / MaAI
View on GitHub
A real-time software for turn-taking, backchannel, and head-nodding prediction
☆101Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Takaaki-Saeki / DiscreteSpeechMetrics
View on GitHub
Reference-aware automatic speech evaluation toolkit
☆185Dec 5, 2024Updated last year
Wataru-Nakata / miipher
View on GitHub
Unofficial implementation of miipher
☆137Apr 19, 2024Updated 2 years ago
csukuangfj / kaldifeat
View on GitHub
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…
☆215Jul 10, 2026Updated last week
LAION-AI / scaled-echo-tts
View on GitHub
Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024…
☆24Mar 29, 2026Updated 3 months ago
kotoba-tech / kotoba-whisper
View on GitHub
☆97Oct 23, 2024Updated last year
keonlee9420 / DailyTalk
View on GitHub
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
☆259Jun 5, 2025Updated last year
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
darashi / narabas
View on GitHub
narabas: Japanese phoneme forced alignment tool
☆15Mar 15, 2023Updated 3 years ago
tsukumijima / pyopenjtalk-plus
View on GitHub
pyopenjtalk-plus: A Python wrapper for OpenJTalk with additional improvements
☆57Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jumon / whisper-punctuator
View on GitHub
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆120Feb 4, 2023Updated 3 years ago
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
k2-fsa / libriheavy
View on GitHub
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
☆220Sep 10, 2024Updated last year
NVIDIA / radtts
View on GitHub
Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …
☆291Apr 6, 2023Updated 3 years ago
projectlucas / efficient_whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆19Dec 1, 2022Updated 3 years ago
imdanboy / jets
View on GitHub
JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech
☆112Jun 6, 2022Updated 4 years ago
sarulab-speech / audio-foundation-model-dataset
View on GitHub
☆65Jan 8, 2025Updated last year