yangdongchao / UniAudio_demo
The demo page of UniAudio
☆34Updated last year
Alternatives and similar repositories for UniAudio_demo:
Users that are interested in UniAudio_demo are comparing it to the libraries listed below
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆76Updated last month
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year
- VoiceLDM: Text-to-Speech with Environmental Context☆169Updated 6 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆90Updated 8 months ago
- ☆39Updated 3 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆44Updated 5 months ago
- ☆36Updated 4 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆79Updated 5 months ago
- Implementation of SoundStorm built upon SpeechTokenizer.☆108Updated last year
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆29Updated last year
- Pytorch implementation of SoundCTM☆81Updated this week
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆79Updated 10 months ago
- Zero-Shot Emotion Style Transfer☆41Updated 10 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆92Updated 4 months ago
- Codebase and project page for EDMSound☆34Updated last year
- Audiogen Codec☆131Updated 7 months ago
- Toward Universal Text-to-Music-Retrieval (TTMR) [ICASSP23]☆114Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Updated 3 months ago
- ☆41Updated last year
- ☆53Updated 7 months ago
- ☆33Updated last year
- The official Implementation of PeriodWave and PeriodWave-Turbo☆158Updated this week
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆39Updated last year
- ☆43Updated 8 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆66Updated 4 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆98Updated 3 weeks ago
- Official Implementation of StyleTTS-VC☆175Updated last month