ZeyueT / AudioX
☆786Updated last week
Alternatives and similar repositories for AudioX
Users that are interested in AudioX are comparing it to the libraries listed below
Sorting:
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆717Updated 2 months ago
- YuE: Open Full-song Generation Foundation for the GPU Poor☆385Updated 3 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆1,766Updated this week
- FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,073Updated this week
- InspireMusic: A Unified Framework for Music, Song, Audio Generation.☆1,086Updated this week
- ☆742Updated 2 months ago
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆578Updated 9 months ago
- ☆862Updated 3 weeks ago
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,486Updated this week
- Official implementations for paper: VACE: All-in-One Video Creation and Editing☆1,481Updated 3 weeks ago
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆1,565Updated this week
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆862Updated 3 weeks ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆328Updated this week
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆269Updated 3 weeks ago
- 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆993Updated 3 weeks ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆813Updated 3 months ago
- Interface for OuteTTS models.☆1,214Updated 2 weeks ago
- OpenMusic: SOTA Text-to-music (TTM) Generation☆559Updated 2 weeks ago
- NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms☆968Updated 3 weeks ago
- Lumina-Image 2.0: A Unified and Efficient Image Generative Framework☆688Updated 3 weeks ago
- Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields☆786Updated 2 weeks ago
- ☆463Updated 2 weeks ago
- The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped No…☆882Updated this week
- ☆446Updated 2 months ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,399Updated 3 weeks ago
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,241Updated 2 weeks ago
- Image editing is worth a single LoRA! 0.1% training data and 1% training parameters for fantastic image editing! Surpasses GPT-4o in ID p…☆1,067Updated this week
- LTX-Video Support for ComfyUI☆1,214Updated this week
- HunyuanVideo GP: Large Video Generation Model - GPU Poor version☆408Updated last month
- ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., au…☆253Updated 3 weeks ago