HKUDS / Agentic-AIGCLinks

"Agentic-AIGC: One Prompt → Video Creation: AI Unleashed"

☆259

Alternatives and similar repositories for Agentic-AIGC

Users that are interested in Agentic-AIGC are comparing it to the libraries listed below

Sorting:

showlab / MovieAgent
MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning
☆249Updated 6 months ago
1230young / bizgen
[CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Gener…
☆294Updated 6 months ago
HumanAIGC / chat-anyone
project page for ChatAnyone
☆113Updated 6 months ago
xxyQwQ / ComfyBench
Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".
☆186Updated 7 months ago
X-PLUG / MM_StoryAgent
☆284Updated last year
yujxx / PodAgent
PodAgent: A Comprehensive Framework for Podcast Generation
☆118Updated 4 months ago
antgroup / echomimic_v3
EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation
☆548Updated last month
vivoCameraResearch / Magic-TryOn
MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.
☆464Updated last month
showlab / livecc
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)
☆274Updated last month
index-tts / index-tts2.github.io
The showcase page of IndexTTS2
☆165Updated 3 weeks ago
HumanAIGC / omnitalker
[NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication
☆382Updated 3 weeks ago
AIGeeksGroup / PresentAgent
[EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation
☆103Updated last week
stepfun-ai / Step-Video-TI2V
☆356Updated 6 months ago
TencentARC / AnimeGamer
[ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction
☆332Updated 6 months ago
Phantom-video / HuMo
HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning
☆658Updated last week
Tencent-Hunyuan / HunyuanVideo-Avatar
☆1,886Updated 3 months ago
byliutao / 1Prompt1Story
🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
☆296Updated 4 months ago
toto222 / DICE-Talk
DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…
☆258Updated 2 months ago
JOY-MM / JoyGen
talking-face video editing
☆383Updated 7 months ago
AriaUI / Aria-UI
Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents
☆382Updated 8 months ago
OpenSQZ / MiniCPM-V-CookBook
Cook up amazing multimodal AI applications effortlessly with MiniCPM-o
☆209Updated this week
Omni-Avatar / OmniAvatar
☆1,681Updated 2 months ago
zjunlp / OmniThink
[EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
☆460Updated last month
yeliudev / VideoMind
💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning
☆263Updated 2 weeks ago
SkyworkAI / SkyReels-A1
SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers
☆561Updated 4 months ago
harlanhong / ACTalker
ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…
☆410Updated last month
mdsrqbl / omnihuman
AI model that understands text & humanoids.
☆126Updated 4 months ago
cangcz / AnchorCrafter
☆623Updated 2 months ago
farewellthree / PPLLaVA
Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"
☆130Updated 10 months ago
maitrix-org / Voila
☆457Updated 5 months ago