uberduck-ai / dataset_viewer
Streamlit app to visualize and edit TTS datasets
☆14Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for dataset_viewer
- TTS Client for Coqui TTS server☆13Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 3 months ago
- Simple PyTorch Denoisers for Waveform Audio☆32Updated last month
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Finally, some decent sample sentences☆22Updated 11 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- Real-time end-to-end singing voice convertion☆18Updated 2 weeks ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated 2 years ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆8Updated 2 years ago
- AudioLDM text to audio colab☆19Updated last year
- [DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instrume…☆15Updated 11 months ago
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆31Updated 11 months ago
- BEGANSing - Korean SVS + SVC + AudioSR☆12Updated 9 months ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated 7 months ago
- ☆10Updated 9 months ago
- Adaptive Vocoder for Custom Voice☆58Updated 2 years ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆11Updated last year
- Heteronym to Phoneme Parser☆15Updated last year
- The implementation of "Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model"☆14Updated 2 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- Easily turn large sets of audio urls to an audio dataset.☆20Updated last year
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Monkey Island fine-tune of Stable Diffusion☆10Updated last year
- SOTA Piano Transformer model trained on 4.2GB of Solo Piano MIDI music☆21Updated last year
- Tools to isolate speaker and transcribe unstructured audio clips☆11Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 9 months ago
- List of repositories relevant to VITS.☆35Updated last year