leavelet / singing-database-makerLinks
AI based singing voice synthesis database generator
☆12Updated 2 years ago
Alternatives and similar repositories for singing-database-maker
Users that are interested in singing-database-maker are comparing it to the libraries listed below
Sorting:
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- ☆21Updated 8 months ago
- A minimum inference engine for DiffSinger☆34Updated last year
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 7 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 9 months ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆22Updated 3 years ago
- ☆13Updated last year
- Project of Singing Voice Conversion.☆14Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Updated 2 years ago
- BEGANSing - Korean SVS + SVC + AudioSR☆11Updated last year
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆27Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Updated 2 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 10 months ago
- ☆41Updated 2 years ago
- trying to reproduce suno v3☆33Updated 5 months ago
- noise reduction☆17Updated 11 months ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆9Updated 3 years ago
- ☆10Updated 7 months ago
- Multispeaker Community Vocoder Model for DiffSinger☆37Updated last month
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated last year
- semantic tokenizer for speech and music☆17Updated this week
- ☆8Updated 10 months ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 10 months ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆26Updated last month
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated last year