Cypress-Yang / SongBloomLinks
☆39Updated this week
Alternatives and similar repositories for SongBloom
Users that are interested in SongBloom are comparing it to the libraries listed below
Sorting:
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆60Updated 2 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆94Updated 6 months ago
- Official implementation for FlowSep☆52Updated 5 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆72Updated last year
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆24Updated 9 months ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆38Updated last year
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆70Updated 5 months ago
- ☆50Updated 2 months ago
- Codebase and project page for EDMSound☆34Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆26Updated last month
- Variable Bitrate Residual Vector Quantization for Audio Coding☆46Updated last month
- ☆40Updated 4 months ago
- ☆114Updated 4 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆73Updated 7 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated 3 months ago
- Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"☆43Updated last month
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆35Updated 7 months ago
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- Official repository of Wavehax vocoder☆52Updated 6 months ago
- An AR+AR TTS attempt.☆16Updated 5 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆69Updated last year
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆30Updated 4 months ago
- SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning☆28Updated this week
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆99Updated last month
- small audio language model for reasoning☆64Updated 2 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆36Updated 4 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆82Updated 3 weeks ago
- ☆44Updated 7 months ago
- ☆103Updated 7 months ago
- ☆44Updated last year