Bai-YT / ConsistencyTTA
ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
☆27Updated 3 months ago
Related projects: ⓘ
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆41Updated last week
- Codebase and project page for EDMSound☆29Updated 10 months ago
- ☆78Updated last year
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆52Updated 3 weeks ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆29Updated 8 months ago
- ☆33Updated 2 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆45Updated 2 months ago
- ☆21Updated last year
- Stable Audio UnOffical Implementation: Latent Diffusion for Audio Generation☆23Updated 7 months ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆81Updated last month
- Speech enhancement in noisy and reverberant environments using deep neural networks☆15Updated last month
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆38Updated last week
- Official Implementation of EnCLAP (ICASSP 2024)☆88Updated 3 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆65Updated 2 weeks ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆13Updated last week
- ☆59Updated 5 months ago
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆23Updated 6 months ago
- ☆18Updated 4 months ago
- ☆37Updated 3 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆21Updated 5 months ago
- Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…☆14Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆28Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆36Updated 3 weeks ago
- Unofficial download repository for MusicCaps☆41Updated last year
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆28Updated 3 months ago
- [ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"☆32Updated last month
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆33Updated last month
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆28Updated last year
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆50Updated 10 months ago
- ☆41Updated 2 months ago