HKUDS / AI-CreatorLinks
"AI-Creator: Multi-Modal Agents for Video Production"
☆143Updated this week
Alternatives and similar repositories for AI-Creator
Users that are interested in AI-Creator are comparing it to the libraries listed below
Sorting:
- PodAgent: A Comprehensive Framework for Podcast Generation☆87Updated 3 weeks ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆167Updated 3 months ago
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆198Updated 2 months ago
- "EasyRec: Simple yet Effective Language Model for Recommendation"☆114Updated 3 months ago
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆362Updated 3 months ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆76Updated 2 weeks ago
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆206Updated 2 weeks ago
- An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social graphs.☆76Updated last month
- Implementing some features of Manus with MCP☆40Updated last month
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆130Updated 6 months ago
- [CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Gener…☆267Updated 2 months ago
- LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)☆211Updated last week
- project page for ChatAnyone☆108Updated 2 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆169Updated this week
- ☆55Updated 3 weeks ago
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆191Updated this week
- OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆452Updated last month
- "Hyper-RAG: Combating LLM Hallucinations using Hypergraph-Driven Retrieval-Augmented Generation" by Yifan Feng, Hao Hu, Xingliang Hou, Sh…☆128Updated last month
- [ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"☆73Updated 5 months ago
- ☆68Updated 8 months ago
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆212Updated last month
- ☆35Updated 6 months ago
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆266Updated last week
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆125Updated 7 months ago
- A open version Manus.☆58Updated 2 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆205Updated last month
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆146Updated 2 months ago
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.☆81Updated last month
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations☆204Updated last month
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆69Updated 4 months ago