haidog-yaqub / EzAudio
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
☆238Updated last week
Related projects ⓘ
Alternatives and complementary repositories for EzAudio
- ☆253Updated 8 months ago
- Fine-tune Stable Audio Open with DiT ControlNet.☆177Updated 2 months ago
- Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.☆150Updated this week
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆222Updated 2 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆159Updated last month
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆365Updated 2 months ago
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆466Updated 3 months ago
- Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…☆156Updated 7 months ago
- Awesome music generation model——MG²☆112Updated this week
- ☆307Updated 2 months ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch☆355Updated 2 weeks ago
- The reproduced code for Google's SoundStorm☆254Updated last year
- ☆87Updated 6 months ago
- OpenMusic: SOTA Text-to-music (TTM) Generation☆479Updated last week
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆125Updated 5 months ago
- Interface for OuteTTS models.☆406Updated 2 weeks ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆84Updated last month
- ☆139Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆138Updated 4 months ago
- Generative models for conditional audio generation☆117Updated this week
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆134Updated last year
- The official Implementation of PeriodWave and PeriodWave-Turbo☆132Updated 3 months ago
- ☆62Updated last month
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆144Updated 3 months ago
- Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation☆353Updated this week
- Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models☆161Updated 5 months ago
- ☆47Updated 4 months ago
- VALL-E 2 reproduction☆87Updated 4 months ago
- text to speech using autoregressive transformer and VITS☆227Updated 7 months ago
- ☆176Updated last month