tencent-ailab / SongGenerationLinks
☆734Updated last month
Alternatives and similar repositories for SongGeneration
Users that are interested in SongGeneration are comparing it to the libraries listed below
Sorting:
- A fundamental toolkit designed for music, song, and audio generation☆1,194Updated 3 months ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆781Updated last month
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动 而且同步的音效 😝☆623Updated last year
- ☆287Updated 2 months ago
- ☆457Updated 3 months ago
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆258Updated 2 months ago
- An Open-Sourced LLM-empowered Foundation TTS System☆769Updated 3 months ago
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆251Updated last month
- ☆310Updated 5 months ago
- EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation☆453Updated last week
- ☆295Updated last year
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆383Updated 2 months ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆559Updated 3 months ago
- Repository of AudioX☆1,073Updated 4 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆308Updated 2 months ago
- Fork of ACE-Step for LoRA training with < 10 GB VRAM☆35Updated 3 weeks ago
- MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting…☆943Updated 2 weeks ago
- OpenMusic: SOTA Text-to-music (TTM) Generation☆609Updated 2 months ago
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆1,296Updated 2 weeks ago
- The showcase page of IndexTTS2☆122Updated 2 months ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆359Updated last month
- YuE: Open Full-song Generation Foundation for the GPU Poor☆437Updated 7 months ago
- ☆445Updated 4 months ago
- Added vLLM support to IndexTTS for faster inference.☆509Updated this week
- Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…☆1,036Updated this week
- ☆1,027Updated 4 months ago
- Awesome music generation model——MG²☆159Updated 5 months ago
- ☆515Updated 3 weeks ago
- MaskGCT-Windows For Windows Users☆66Updated 3 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆445Updated 3 weeks ago