declare-lab / TangoFlux
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
☆560Updated this week
Alternatives and similar repositories for TangoFlux:
Users that are interested in TangoFlux are comparing it to the libraries listed below
- [arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆966Updated this week
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆507Updated 5 months ago
- Diffusion-based Portrait and Animal Animation☆608Updated this week
- Interface for OuteTTS models.☆859Updated this week
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆252Updated 2 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆664Updated last month
- OpenMusic: SOTA Text-to-music (TTM) Generation☆502Updated 2 weeks ago
- Implementation of "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"☆556Updated last month
- Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration☆585Updated 3 months ago
- InspireMusic: A Unified Framework for Music, Song, Audio Generation.☆305Updated 3 weeks ago
- This repository is the official implementation of "DisPose: Disentangling Pose Guidance for Controllable Human Image Animation"☆302Updated last week
- Implementation of F5-TTS in MLX☆429Updated last week
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆863Updated 2 months ago
- Source code for the SIGGRAPH 2024 paper "X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention"☆448Updated 5 months ago
- Taming Stable Diffusion for Lip Sync!☆1,856Updated this week
- Official Implementations for Paper - AniDoc: Animation Creation Made Easier☆443Updated 2 weeks ago
- ☆661Updated this week
- StoryMaker: Towards consistent characters in text-to-image generation☆628Updated last month
- Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!☆606Updated this week
- gradio WebUI for AdvancedLivePortrait☆416Updated last week
- Learning Flow Fields in Attention for Controllable Person Image Generation☆944Updated last week
- AuraSR: GAN-based Super-Resolution for real-world☆425Updated 2 months ago
- ☆338Updated last month
- A Fast TTS Engine☆405Updated last week
- Animate-X - PyTorch Implementation☆298Updated last month
- The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"☆337Updated 3 weeks ago
- ☆419Updated last month
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆262Updated last week
- ☆354Updated 2 months ago
- ☆293Updated 6 months ago