AgentCooper2002 / EDMSound
Codebase and project page for EDMSound
☆29Updated last year
Related projects ⓘ
Alternatives and complementary repositories for EDMSound
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆18Updated 2 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆66Updated last week
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆48Updated 3 weeks ago
- ☆32Updated 2 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆42Updated 2 months ago
- ☆34Updated 5 months ago
- ☆23Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆42Updated 4 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆31Updated 10 months ago
- GPT for FACodec☆13Updated 7 months ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆28Updated 3 weeks ago
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆57Updated 2 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 3 months ago
- Zero-Shot Emotion Style Transfer☆37Updated 7 months ago
- ☆61Updated 7 months ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 9 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆66Updated last year
- ☆45Updated last month
- Temporary anonymous version☆22Updated 8 months ago
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆19Updated 2 weeks ago
- ☆34Updated 7 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆15Updated last month
- ☆40Updated 5 months ago
- Official repository of Wavehax vocoder☆20Updated last week
- ☆25Updated 3 months ago
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆35Updated last year
- ☆49Updated 3 weeks ago
- A spoken version of the textual story cloze benchmark☆14Updated last year