declare-lab / TangoFluxLinks
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
☆781Updated last month
Alternatives and similar repositories for TangoFlux
Users that are interested in TangoFlux are comparing it to the libraries listed below
Sorting:
- A fundamental toolkit designed for music, song, and audio generation☆1,194Updated 3 months ago
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆626Updated last year
- ☆734Updated last month
- ☆516Updated 3 weeks ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆308Updated 2 months ago
- ☆457Updated 3 months ago
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,843Updated 3 weeks ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆359Updated last month
- Diffusion-based Portrait and Animal Animation☆828Updated 6 months ago
- ☆753Updated 6 months ago
- Repository of AudioX☆1,073Updated 4 months ago
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,173Updated 3 months ago
- OpenMusic: SOTA Text-to-music (TTM) Generation☆609Updated 2 months ago
- YuE: Open Full-song Generation Foundation for the GPU Poor☆437Updated 7 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆613Updated 5 months ago
- ☆1,027Updated 4 months ago
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆383Updated 2 months ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,546Updated 3 weeks ago
- [IJCV] Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆279Updated last week
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆559Updated 3 months ago
- Interface for OuteTTS models.☆1,375Updated 2 months ago
- ☆287Updated 2 months ago
- ☆447Updated 4 months ago
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆251Updated last month
- [ICLR2025] DisPose: Disentangling Pose Guidance for Controllable Human Image Animation☆371Updated 7 months ago
- ☆1,841Updated 2 months ago
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆1,558Updated 3 weeks ago
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆258Updated 2 months ago
- Lumina-Image 2.0: A Unified and Efficient Image Generative Framework☆792Updated 2 months ago
- ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…☆386Updated 3 weeks ago