一个快速制作语音数据集的可视化工具
☆199Mar 7, 2024Updated 2 years ago
Alternatives and similar repositories for sound_dataset_tools2
Users that are interested in sound_dataset_tools2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 语音数据集制作标记工具☆138Oct 30, 2022Updated 3 years ago
- 数据集自动化制作脚本☆72Mar 26, 2023Updated 3 years ago
- An auxiliary tool for manual screening of audio dataset.☆132Jun 23, 2023Updated 2 years ago
- Copied from official repo of VITS. Added some comments.☆19Sep 24, 2024Updated last year
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆17Apr 13, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor☆281Jul 16, 2023Updated 2 years ago
- A voiceprint recognition classifier for audio dataset☆105Jun 21, 2023Updated 2 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!☆1,230Feb 5, 2024Updated 2 years ago
- Fine-Tuning your VITS model using a pre-trained model☆551May 2, 2023Updated 2 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆416Nov 20, 2025Updated 4 months ago
- GPT for FACodec☆13Mar 25, 2024Updated 2 years ago
- VITS for Mandarin. Support Windows and Linux, low-end and high-end hardwares☆111May 6, 2023Updated 2 years ago
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆5,022Jan 21, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 4 months ago
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆524Dec 28, 2023Updated 2 years ago
- Unoffical implementation of Megatts2☆285Mar 23, 2024Updated 2 years ago
- 无需情感标注的情感可控语音合成模型,基于VITS☆1,395Mar 30, 2023Updated 3 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆60Apr 4, 2024Updated 2 years ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Oct 23, 2024Updated last year
- 多个SVC/TTS的C++推理库☆1,120May 18, 2025Updated 11 months ago
- PITS-中日英韩☆12Mar 14, 2023Updated 3 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Jan 10, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆281Sep 10, 2023Updated 2 years ago
- ☆12Nov 7, 2024Updated last year
- An auxiliary tool for manual screening of audio dataset.☆20Jul 18, 2023Updated 2 years ago
- Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)☆162May 7, 2023Updated 2 years ago
- ☆71Jul 13, 2023Updated 2 years ago
- a lightweight voice conversion☆86Feb 25, 2026Updated last month
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆237Feb 29, 2024Updated 2 years ago
- VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai☆943Dec 6, 2023Updated 2 years ago
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆197Sep 15, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- A open-source toolkit for single and multi-modal speaker verification from modelscope and funasr with onnx☆15Dec 16, 2023Updated 2 years ago
- Train the next generation of TTS systems.☆170Sep 13, 2024Updated last year
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform☆48Nov 23, 2022Updated 3 years ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆53Jun 2, 2020Updated 5 years ago
- PaddleSpeech TTS Android Demo 的改进,实现了中英文混合模型的推理和中英文混合 c++ 前端☆47Mar 21, 2023Updated 3 years ago
- ☆23Oct 17, 2024Updated last year