sony / diffusion-timbre-transfer
☆31Updated 2 months ago
Alternatives and similar repositories for diffusion-timbre-transfer:
Users that are interested in diffusion-timbre-transfer are comparing it to the libraries listed below
- Codebase and project page for EDMSound☆33Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆15Updated this week
- singing voice conversion without f0☆23Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆18Updated 3 months ago
- GPT for FACodec☆13Updated 9 months ago
- Landing Page for All Things Source Separation☆19Updated 2 months ago
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆27Updated last month
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆73Updated 2 weeks ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆20Updated 3 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated 2 weeks ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆13Updated 2 weeks ago
- My vocoder experiments☆25Updated 2 months ago
- ☆10Updated 2 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 5 months ago
- ☆12Updated last year
- PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.☆10Updated 6 months ago
- Official source codes of airsep☆35Updated 9 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- Supervoice diffusion enhance☆26Updated 5 months ago
- ☆62Updated 9 months ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 10 months ago
- An AR+AR TTS attempt.☆13Updated 3 weeks ago
- ☆23Updated 2 months ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆18Updated 4 months ago
- ☆26Updated 10 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- Project for MIDI to Audio Synthesis☆22Updated last year
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆32Updated last week
- Viterbi decoding in PyTorch☆27Updated 3 months ago