Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'
☆20Jul 24, 2024Updated last year
Alternatives and similar repositories for ee-diffusion
Users that are interested in ee-diffusion are comparing it to the libraries listed below
Sorting:
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- My vocoder experiments☆31Jul 26, 2025Updated 7 months ago
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Jun 18, 2025Updated 8 months ago
- Official implementation of OSSGAN [CVPR 2022]☆21May 2, 2022Updated 3 years ago
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 8 months ago
- ☆49Apr 1, 2025Updated 11 months ago
- ☆10Apr 8, 2024Updated last year
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- Official code for SongEcho☆41Feb 21, 2026Updated last week
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆61Jul 1, 2025Updated 8 months ago
- ☆54Mar 2, 2023Updated 3 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- Use quantized versions of Whisper to speed up inference☆12Oct 16, 2024Updated last year
- ☆61Oct 28, 2024Updated last year
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆93Mar 12, 2025Updated 11 months ago
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆64Nov 5, 2025Updated 3 months ago
- ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Nov 16, 2025Updated 3 months ago
- BigVGAN with Neural Source-Filter☆56Sep 21, 2023Updated 2 years ago
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆35Oct 31, 2025Updated 4 months ago
- A neural network for filtering target speaker's voice from audio written in tensorflow☆21Jun 21, 2018Updated 7 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆59Oct 23, 2024Updated last year
- Training code and trained checkpoints for ASGAN.☆62Dec 27, 2023Updated 2 years ago
- Simple torch.nn.module implementation of Alias-Free-GAN style filter and resample☆99Jul 26, 2022Updated 3 years ago
- ☆23Jun 13, 2023Updated 2 years ago
- The Multi-band Excited WaveNet☆15Feb 2, 2023Updated 3 years ago
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated last year
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆17May 20, 2025Updated 9 months ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- ☆99Jan 19, 2026Updated last month
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- ☆18Jan 17, 2022Updated 4 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Sep 2, 2025Updated 6 months ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆76Jan 25, 2026Updated last month
- [ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling☆78Sep 28, 2025Updated 5 months ago
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆53Updated this week
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago