uberduck-ai / dataset_viewer
Streamlit app to visualize and edit TTS datasets
☆14Updated 3 years ago
Alternatives and similar repositories for dataset_viewer:
Users that are interested in dataset_viewer are comparing it to the libraries listed below
- Finally, some decent sample sentences☆22Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- A simple voice conversion tool☆17Updated 2 years ago
- singing voice conversion based on glow-tts☆11Updated last year
- List of repositories relevant to VITS.☆36Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 7 months ago
- Heteronym to Phoneme Parser☆18Updated last year
- Non Parallel Voice Conversion based on VITS☆24Updated last year
- ☆28Updated last year
- ☆10Updated 3 months ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated 10 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Updated 2 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆73Updated last year
- Text prompt steered synthetic audio generators☆45Updated last year
- Collect Voice Conversion researches☆91Updated last week
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆29Updated 2 years ago
- MFA acoustic model training based on Opencpop☆14Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated last year
- Zero-Shot Emotion Style Transfer☆41Updated 10 months ago
- StyleTTS 2 Optimized Training Fork☆24Updated last month
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- ☆13Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆22Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Updated last year