HKUDS / AI-CreatorLinks
"AI-Creator: Multi-Modal Agents for Video Production"
☆157Updated 2 weeks ago
Alternatives and similar repositories for AI-Creator
Users that are interested in AI-Creator are comparing it to the libraries listed below
Sorting:
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆173Updated 3 months ago
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆209Updated 3 months ago
- PodAgent: A Comprehensive Framework for Podcast Generation☆93Updated last month
- project page for ChatAnyone☆109Updated 3 months ago
- "EasyRec: Simple yet Effective Language Model for Recommendation"☆114Updated 4 months ago
- ☆35Updated 6 months ago
- ☆255Updated 10 months ago
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆370Updated 4 months ago
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆217Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated last month
- [CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Gener…☆276Updated 2 months ago
- All-round Creator and Editor☆223Updated 5 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆206Updated last week
- ☆180Updated 3 weeks ago
- LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)☆229Updated last week
- Efficient Agent Training for Computer Use☆106Updated 3 weeks ago
- CursorCore: Assist Programming through Aligning Anything☆125Updated 4 months ago
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆271Updated 3 weeks ago
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"☆75Updated this week
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation☆251Updated 4 months ago
- ☆273Updated 3 weeks ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆242Updated 4 months ago
- ☆26Updated 2 weeks ago
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.☆186Updated 11 months ago
- The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"☆112Updated last week
- ☆77Updated 2 months ago
- ☆41Updated this week
- MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.☆309Updated this week
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆183Updated last week
- ☆336Updated 3 months ago