karayakar / OrpheusTTS-ParquetDatasetCreatorLinks
This app creates or read parquet dataset
☆28Updated 4 months ago
Alternatives and similar repositories for OrpheusTTS-ParquetDatasetCreator
Users that are interested in OrpheusTTS-ParquetDatasetCreator are comparing it to the libraries listed below
Sorting:
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆120Updated last month
- ☆275Updated last month
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆185Updated 11 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆69Updated last year
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆155Updated last year
- Text-audio foundation model from Boson AI☆90Updated 2 weeks ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆133Updated 4 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆251Updated last year
- Real-time Speech-Text Foundation Model Toolkit (wip)☆243Updated 5 months ago
- finetune llm part for spark-tts model☆106Updated 5 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆78Updated 11 months ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆297Updated last month
- ☆167Updated 8 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆115Updated 3 months ago
- text to speech using autoregressive transformer and VITS☆243Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆102Updated 7 months ago
- F5-TTS 推理加速,速度提升约4倍!☆108Updated 7 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated last month
- a Frontier Japanese Speech Generation net☆52Updated 3 months ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆127Updated 2 years ago
- ☆248Updated 2 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆85Updated 9 months ago
- ☆147Updated 6 months ago
- ☆109Updated last week
- Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models☆179Updated last week
- Voice gender classifier using ECAPA-TDNN☆56Updated 7 months ago
- Train the next generation of TTS systems.☆167Updated 11 months ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆103Updated 8 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆129Updated 9 months ago
- ☆307Updated 4 months ago