ituvisionlab / EdVAE
Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"
☆10Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for EdVAE
- A neural speech codec based on discrete WavLM representations☆21Updated 2 months ago
- A spoken version of the textual story cloze benchmark☆14Updated last year
- ☆15Updated 4 months ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated 9 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- A toolkit dedicate for speech evaluation.☆18Updated last month
- ☆20Updated 10 months ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆15Updated 2 weeks ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆37Updated last month
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆22Updated 7 months ago
- Stable Audio UnOffical Implementation: Latent Diffusion for Audio Generation☆23Updated 9 months ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆21Updated 8 months ago
- Generation scripts for EARS-WHAM and EARS-Reverb☆23Updated 2 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 11 months ago
- Official code of ElasticAST (Interspeech 2024 paper)☆23Updated 3 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- An ODE-based generative neural vocoder using Rectified Flow☆61Updated last year
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆22Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆42Updated 3 weeks ago
- Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Updated last year
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆57Updated 2 months ago
- ☆47Updated last week
- WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆27Updated 3 weeks ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆48Updated 3 weeks ago
- Official repository of Wavehax vocoder☆28Updated last week
- ☆34Updated 5 months ago
- ESLTTS dataset☆16Updated 5 months ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆31Updated 10 months ago
- An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"☆28Updated last year