yangdongchao / UniAudio_demo
The demo page of UniAudio
☆33Updated last year
Alternatives and similar repositories for UniAudio_demo:
Users that are interested in UniAudio_demo are comparing it to the libraries listed below
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆86Updated 4 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆91Updated 11 months ago
- Unsupervised Rhythm Modeling for Voice Conversion☆81Updated last year
- VoiceLDM: Text-to-Speech with Environmental Context☆175Updated 8 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆76Updated last year
- ☆108Updated 3 months ago
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆47Updated last year
- ☆41Updated 6 months ago
- ☆62Updated 9 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- Codebase and project page for EDMSound☆34Updated last year
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆111Updated last year
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆113Updated last year
- ☆66Updated last year
- Pytorch implementation of SoundCTM☆93Updated last month
- GPT for FACodec☆13Updated last year
- ☆40Updated 2 months ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated 4 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated last month
- ☆35Updated last year
- Unofficial download repository for MusicCaps☆47Updated 2 years ago
- ☆59Updated last year
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆83Updated 8 months ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆29Updated last year
- Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…☆182Updated last year
- ☆41Updated last year
- Official source codes of airsep☆36Updated last year
- ☆38Updated 7 months ago
- ☆77Updated 6 months ago
- small audio language model for reasoning☆61Updated 2 weeks ago