yangdongchao / UniAudio_demo
The demo page of UniAudio
☆34Updated 7 months ago
Related projects: ⓘ
- Unsupervised Rhythm Modeling for Voice Conversion☆78Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆74Updated 2 months ago
- The official GitHub page for the survey paper "Foundation Models for Music: A Survey".☆79Updated 2 weeks ago
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆111Updated last month
- PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.☆169Updated 3 weeks ago
- The official Implementation of PeriodWave and PeriodWave-Turbo☆107Updated last month
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆111Updated last year
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆65Updated 2 weeks ago
- Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…☆141Updated 5 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆68Updated 2 months ago
- VoiceLDM: Text-to-Speech with Environmental Context☆157Updated last month
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆27Updated 8 months ago
- Zero-Shot Emotion Style Transfer☆33Updated 5 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆88Updated 3 months ago
- ☆61Updated last month
- ☆59Updated 5 months ago
- Codebase and project page for EDMSound☆29Updated 9 months ago
- VALL-E 2 reproduction☆72Updated 2 months ago
- ☆54Updated last month
- Transcribing Speech with Multinomial Diffusion, training code and models.☆74Updated 11 months ago
- Audiogen Codec☆116Updated 2 months ago
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆109Updated 8 months ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆38Updated last year
- Pytorch implementation of SoundCTM☆68Updated 3 weeks ago
- Official source codes of airsep☆33Updated 5 months ago
- The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".☆26Updated last week
- AudioBench: A Universal Benchmark for Audio Large Language Models☆61Updated 2 weeks ago
- Finetuning VITS Efficiently☆31Updated 10 months ago
- ☆44Updated this week
- Unofficial download repository for MusicCaps☆41Updated last year