declare-lab / TangoFluxLinks
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
☆772Updated 3 weeks ago
Alternatives and similar repositories for TangoFlux
Users that are interested in TangoFlux are comparing it to the libraries listed below
Sorting:
- ☆706Updated 2 weeks ago
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆618Updated last year
- A fundamental toolkit designed for music, song, and audio generation☆1,182Updated 3 months ago
- ☆514Updated last week
- Repository of AudioX☆1,065Updated 3 months ago
- OpenMusic: SOTA Text-to-music (TTM) Generation☆606Updated 2 months ago
- ☆751Updated 6 months ago
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,801Updated last week
- Diffusion-based Portrait and Animal Animation☆822Updated 5 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆308Updated last month
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,158Updated 2 months ago
- YuE: Open Full-song Generation Foundation for the GPU Poor☆433Updated 6 months ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆352Updated 2 weeks ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,504Updated this week
- ☆454Updated 3 months ago
- ☆1,023Updated 3 months ago
- ☆1,800Updated 2 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆1,052Updated 2 weeks ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆559Updated 2 months ago
- ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…☆375Updated this week
- Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆271Updated 3 weeks ago
- Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"☆845Updated 6 months ago
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆255Updated last month
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆603Updated 4 months ago
- Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!☆957Updated last month
- Select a portrait, click to move the head around (please use your own space / GPU!)☆893Updated last week
- ☆277Updated last month
- Lumina-Image 2.0: A Unified and Efficient Image Generative Framework☆780Updated last month
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆185Updated this week
- ☆441Updated 3 months ago