HKUDS / Agentic-AIGCLinks
"Agentic-AIGC: One Prompt → Video Creation: AI Unleashed"
☆259Updated last week
Alternatives and similar repositories for Agentic-AIGC
Users that are interested in Agentic-AIGC are comparing it to the libraries listed below
Sorting:
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆249Updated 6 months ago
- [CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Gener…☆294Updated 6 months ago
- project page for ChatAnyone☆113Updated 6 months ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆186Updated 7 months ago
- ☆284Updated last year
- PodAgent: A Comprehensive Framework for Podcast Generation☆118Updated 4 months ago
- EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation☆548Updated last month
- MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.☆464Updated last month
- LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)☆274Updated last month
- The showcase page of IndexTTS2☆165Updated 3 weeks ago
- [NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication☆382Updated 3 weeks ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆103Updated last week
- ☆356Updated 6 months ago
- [ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction☆332Updated 6 months ago
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆658Updated last week
- ☆1,886Updated 3 months ago
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆296Updated 4 months ago
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portrai…☆258Updated 2 months ago
- talking-face video editing☆383Updated 7 months ago
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆382Updated 8 months ago
- Cook up amazing multimodal AI applications effortlessly with MiniCPM-o☆209Updated this week
- ☆1,681Updated 2 months ago
- [EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆460Updated last month
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆263Updated 2 weeks ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆561Updated 4 months ago
- ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…☆410Updated last month
- AI model that understands text & humanoids.☆126Updated 4 months ago
- ☆623Updated 2 months ago
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆130Updated 10 months ago
- ☆457Updated 5 months ago