uberduck-ai / dataset_viewerLinks
Streamlit app to visualize and edit TTS datasets
☆14Updated 3 years ago
Alternatives and similar repositories for dataset_viewer
Users that are interested in dataset_viewer are comparing it to the libraries listed below
Sorting:
- Finally, some decent sample sentences☆23Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 10 months ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 10 months ago
- Heteronym to Phoneme Parser☆18Updated last year
- A simple voice conversion tool☆17Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Updated 2 years ago
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Updated last year
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Creates video from TTS output and viseme images.☆12Updated 3 years ago
- A collection of all our phonemeizers for dataset construction and inference☆23Updated 4 months ago
- List of repositories relevant to VITS.☆36Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆18Updated 3 months ago
- singing voice conversion based on glow-tts☆11Updated last year
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- AudioLDM text to audio colab☆19Updated last year
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Simple PyTorch Denoisers for Waveform Audio☆35Updated 2 months ago
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆15Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- ☆14Updated 2 years ago
- ☆29Updated last year
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Non Parallel Voice Conversion based on VITS☆24Updated 2 years ago
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated 2 years ago
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Updated 2 months ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆60Updated 2 years ago