index-tts / index-tts2.github.ioLinks

The showcase page of IndexTTS2

☆179

Alternatives and similar repositories for index-tts2.github.io

Users that are interested in index-tts2.github.io are comparing it to the libraries listed below

Sorting:

vivoCameraResearch / Magic-TryOn
MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.
☆512Updated 2 weeks ago
antonibigata / keysync
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution
☆376Updated 2 weeks ago
DecartAI / Lucy-Edit-ComfyUI
☆716Updated 3 months ago
AIDC-AI / Pixelle-MCP
An Open-Source Multimodal AIGC Solution based on ComfyUI + MCP + LLM https://pixelle.ai
☆911Updated last month
HumanAIGC / omnitalker
[NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication
☆418Updated 4 months ago
univa-agent / univa
Official Code Repo for UniVA: Universal Video Agents
☆343Updated 2 weeks ago
HumanAIGC / chat-anyone
project page for ChatAnyone
☆116Updated 10 months ago
toto222 / DICE-Talk
DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…
☆292Updated 6 months ago
stepfun-ai / Step-Audio-EditX
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…
☆870Updated this week
MYZY-AI / Muyan-TTS
☆474Updated 8 months ago
playht / PlayDiffusion
☆537Updated 4 months ago
TencentARC / ToonComposer
[ICLR 2026] Streamlining Cartoon Production with Generative Post-Keyframing
☆541Updated 5 months ago
oneCodeSuperman / LstmSync
开源的LstmSync数字人泛化模型，只做最好的泛化模型！
☆140Updated this week
yuyou-dev / Vibe-Agent
手搓Agent系列，香蕉Pro邪修应用和gemini本地化部署
☆384Updated last month
Kevin-thu / StoryMem
Official code for StoryMem: Multi-shot Long Video Storytelling with Memory
☆644Updated 3 weeks ago
antgroup / echomimic_v3
[AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation
☆755Updated last week
FunAudioLLM / Fun-Audio-Chat
Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.
☆835Updated 2 weeks ago
ssj9596 / One-to-All-Animation
One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
☆436Updated last month
maitrix-org / Voila
☆486Updated 9 months ago
Alibaba-Quark / LiveAvatar
Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"
☆1,750Updated 2 weeks ago
zai-org / GLM-TTS
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
☆923Updated last month
FotographerAI / ZenCtrl
In-context subject-driven image generation while preserving foreground fidelity
☆351Updated 8 months ago
SkyworkAI / skyreels-a3.github.io
project for skyreels-a3
☆78Updated 6 months ago
deepbrainai-research / float
[ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.
☆457Updated 3 months ago
WeChatCV / Stand-In
Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.
☆725Updated last month
Fantasy-AMAP / fantasy-portrait
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
☆501Updated 5 months ago
HKoon / ChatTTS-OpenVoice
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
☆459Updated last year
X-PLUG / MM_StoryAgent
☆301Updated last year
1230young / bizgen
[CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Gener…
☆299Updated 10 months ago
declare-lab / TangoFlux
[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
☆831Updated 2 weeks ago