HKUDS / ViMaxLinks
"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"
☆667Updated 2 weeks ago
Alternatives and similar repositories for ViMax
Users that are interested in ViMax are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Gener…☆293Updated 7 months ago
- ☆289Updated last year
- MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.☆481Updated 3 months ago
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆262Updated 7 months ago
- Video generation via code☆901Updated last week
- Lemon AI is the first Full-stack Open-source Self-Evolving General AI Agent, offering a fully local alternative to Agentic platforms like…☆1,290Updated last week
- An Open-Source Multimodal AIGC Solution based on ComfyUI + MCP + LLM https://pixelle.ai☆782Updated 2 weeks ago
- [NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication☆394Updated 2 months ago
- Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video ge…☆1,088Updated this week
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,192Updated last month
- The showcase page of IndexTTS2☆171Updated 2 months ago
- HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning☆872Updated last month
- project page for ChatAnyone☆115Updated 7 months ago
- ☆1,156Updated 2 weeks ago
- ☆1,933Updated last month
- ☆1,723Updated 3 months ago
- A Training-free Iterative Framework for Long Story Visualization☆929Updated 10 months ago
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆306Updated last month
- ☆670Updated 2 weeks ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆568Updated 5 months ago
- AI-Powered Video Retrieval & Clipping Tool☆363Updated 3 months ago
- A simple agent framework that's capable of browser use + mcp + auto instrument + plan + deep research + more☆328Updated last month
- ☆362Updated 8 months ago
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆3,109Updated 2 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆677Updated 5 months ago
- [AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation☆618Updated last week
- LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)☆304Updated 3 weeks ago
- In-context subject-driven image generation while preserving foreground fidelity☆351Updated 5 months ago
- [EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆466Updated 2 months ago
- ☆634Updated 3 months ago