yangdongchao / UniAudio_demoLinks
The demo page of UniAudio
☆33Updated last year
Alternatives and similar repositories for UniAudio_demo
Users that are interested in UniAudio_demo are comparing it to the libraries listed below
Sorting:
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- ☆113Updated 3 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆76Updated last year
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆118Updated 2 months ago
- Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis☆39Updated last year
- Official Implementation of EnCLAP (ICASSP 2024)☆91Updated last year
- ☆66Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated 5 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆88Updated 5 months ago
- ☆39Updated 8 months ago
- ☆58Updated last year
- VoiceLDM: Text-to-Speech with Environmental Context☆177Updated 9 months ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated 2 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆58Updated last month
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- ☆25Updated last year
- Implementation of SoundStorm built upon SpeechTokenizer.☆112Updated last year
- Putting flows on top of neural transducers for better TTS☆62Updated last week
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- A trainer for SNAC (Multi-Scale Neural Audio Codec) has replaced the decoder with Vocos.☆52Updated 7 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆53Updated last year
- ☆41Updated 6 months ago
- ☆41Updated 2 years ago
- [ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT☆48Updated last year
- Temporary anonymous version☆22Updated last year
- ☆62Updated 10 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆58Updated 7 months ago
- ☆35Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago