一个快速制作语音数据集的可视化工具
☆198Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for sound_dataset_tools2
Users that are interested in sound_dataset_tools2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 语音数据集制作标记工具☆138Oct 30, 2022Updated 3 years ago
- 数据集自动化制作脚本☆72Mar 26, 2023Updated 3 years ago
- An auxiliary tool for manual screening of audio dataset.☆132Jun 23, 2023Updated 2 years ago
- Copied from official repo of VITS. Added some comments.☆19Sep 24, 2024Updated last year
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆17Apr 13, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆281Jul 16, 2023Updated 2 years ago
- A voiceprint recognition classifier for audio dataset☆105Jun 21, 2023Updated 2 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!☆1,227Feb 5, 2024Updated 2 years ago
- Fine-Tuning your VITS model using a pre-trained model☆551May 2, 2023Updated 2 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆417Nov 20, 2025Updated 4 months ago
- GPT for FACodec☆13Mar 25, 2024Updated 2 years ago
- VITS for Mandarin. Support Windows and Linux, low-end and high-end hardwares☆111May 6, 2023Updated 2 years ago
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆5,026Jan 21, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 3 months ago
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆524Dec 28, 2023Updated 2 years ago
- Unoffical implementation of Megatts2☆288Mar 23, 2024Updated 2 years ago
- 无需情感标注的情感可控语音合成模型,基于VITS☆1,397Mar 30, 2023Updated 2 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Oct 23, 2024Updated last year
- 多个SVC/TTS的C++推理库☆1,121May 18, 2025Updated 10 months ago
- PITS-中 日英韩☆12Mar 14, 2023Updated 3 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆278Sep 10, 2023Updated 2 years ago
- ☆12Nov 7, 2024Updated last year
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆162May 7, 2023Updated 2 years ago
- An auxiliary tool for manual screening of audio dataset.☆20Jul 18, 2023Updated 2 years ago
- ☆71Jul 13, 2023Updated 2 years ago
- a lightweight voice conversion☆86Feb 25, 2026Updated last month
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆237Feb 29, 2024Updated 2 years ago
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆941Dec 6, 2023Updated 2 years ago
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆197Sep 15, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx☆15Dec 16, 2023Updated 2 years ago
- Train the next generation of TTS systems.☆171Sep 13, 2024Updated last year
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆48Nov 23, 2022Updated 3 years ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆53Jun 2, 2020Updated 5 years ago
- PaddleSpeech TTS Android Demo 的改进,实现了中英文混合模型的推理和中英文混合 c++ 前端☆47Mar 21, 2023Updated 3 years ago
- ☆23Oct 17, 2024Updated last year