kslz/sound_dataset_tools2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kslz/sound_dataset_tools2)

kslz / sound_dataset_tools2

一个快速制作语音数据集的可视化工具

☆200

Alternatives and similar repositories for sound_dataset_tools2

Users that are interested in sound_dataset_tools2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kslz / SoundLabel
View on GitHub
语音数据集制作标记工具
☆135Oct 30, 2022Updated 3 years ago
innnky / audio-preprocessing-scripts
View on GitHub
数据集自动化制作脚本
☆71Mar 26, 2023Updated 3 years ago
2DIPW / audio_dataset_screener
View on GitHub
An auxiliary tool for manual screening of audio dataset.
☆132Jun 23, 2023Updated 3 years ago
Movelocity / vits_for_chinese
View on GitHub
Copied from official repo of VITS. Added some comments.
☆19Sep 24, 2024Updated last year
PriesiaMioShirakana / Pits-Japanese-Onnx
View on GitHub
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆17Apr 13, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
anonymous-pits / pits
View on GitHub
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆280Jul 16, 2023Updated 3 years ago
2DIPW / audio_dataset_vpr
View on GitHub
A voiceprint recognition classifier for audio dataset
☆105Jun 21, 2023Updated 3 years ago
ex3ndr / supervoice-librilight-preprocessed
View on GitHub
60k hours of phoneme-aligned audio from audio books
☆19Jul 27, 2024Updated 2 years ago
PlayVoice / vits_chinese
View on GitHub
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
☆1,229Feb 5, 2024Updated 2 years ago
SayaSS / vits-finetuning
View on GitHub
Fine-Tuning your VITS model using a pre-trained model
☆544May 2, 2023Updated 3 years ago
wenet-e2e / wetts
View on GitHub
Production First and Production Ready End-to-End Text-to-Speech Toolkit
☆416Nov 20, 2025Updated 8 months ago
innnky / emotional-vits
View on GitHub
无需情感标注的情感可控语音合成模型，基于VITS
☆1,392Mar 30, 2023Updated 3 years ago
rotten-work / vits-mandarin-windows
View on GitHub
VITS for Mandarin. Support Windows and Linux, low-end and high-end hardwares
☆110May 6, 2023Updated 3 years ago
Plachtaa / VITS-fast-fine-tuning
View on GitHub
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
☆5,019Jan 21, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ex3ndr / supervoice-gpt-facodec
View on GitHub
GPT for FACodec
☆13Mar 25, 2024Updated 2 years ago
pengzhendong / g2p-mix
View on GitHub
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.
☆115Updated this week
modelscope / KAN-TTS
View on GitHub
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…
☆525Dec 28, 2023Updated 2 years ago
LSimon95 / megatts2
View on GitHub
Unoffical implementation of Megatts2
☆285Mar 23, 2024Updated 2 years ago
0913ktg / SC_VALL-E
View on GitHub
Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E
☆136Oct 23, 2024Updated last year
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
View on GitHub
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆60Apr 4, 2024Updated 2 years ago
PriesiaMioShirakana / DragonianVoice
View on GitHub
多个SVC/TTS的C++推理库
☆1,128May 18, 2025Updated last year
innnky / pits
View on GitHub
PITS-中日英韩
☆12Mar 14, 2023Updated 3 years ago
ml-for-speech / speechtoolkit
View on GitHub
[Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…
☆21Jan 10, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Executedone / Chinese-FastSpeech2
View on GitHub
基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏
☆277Sep 10, 2023Updated 2 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
uthree / tinyvc
View on GitHub
a lightweight voice conversion
☆87Feb 25, 2026Updated 5 months ago
AlexandaJerry / whisper-vits-japanese
View on GitHub
Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)
☆162May 7, 2023Updated 3 years ago
lifeiteng / SoundStorm
View on GitHub
☆71Jul 13, 2023Updated 3 years ago
adelacvg / NS2VC
View on GitHub
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
☆237Feb 29, 2024Updated 2 years ago
yuan1615 / AdaVocoder
View on GitHub
Adaptive Vocoder for Custom Voice
☆61Sep 22, 2022Updated 3 years ago
HuaWaterED / audio_dataset_screener
View on GitHub
An auxiliary tool for manual screening of audio dataset.
☆20Jul 18, 2023Updated 3 years ago
CjangCjengh / vits
View on GitHub
VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai
☆939Dec 6, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Zz-ww / VITS-BigVGAN-SpanPSP-Chinese
View on GitHub
基于PyTorch的VITS-BigVGAN的tts中文模型，加入韵律预测模型。
☆198Sep 15, 2022Updated 3 years ago
lovemefan / campplus
View on GitHub
A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx
☆15Dec 16, 2023Updated 2 years ago
innnky / MB-iSTFT-VITS
View on GitHub
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
☆47Nov 23, 2022Updated 3 years ago
adelacvg / ttts
View on GitHub
Train the next generation of TTS systems.
☆169Sep 13, 2024Updated last year
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
Liu-Feng-deeplearning / TTS-frontend
View on GitHub
TTS-frontend with Bert and CRF/lstm (For Tacotron)
☆53Jun 2, 2020Updated 6 years ago
kakaobrain / g2pm
View on GitHub
A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset
☆367Dec 24, 2021Updated 4 years ago