HKUDS / AI-CreatorLinks
"AI-Creator: Multi-Modal Agents for Video Production"
☆185Updated last week
Alternatives and similar repositories for AI-Creator
Users that are interested in AI-Creator are comparing it to the libraries listed below
Sorting:
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆228Updated 4 months ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆180Updated 5 months ago
- "Vimo: Chat with Your Videos"☆823Updated last week
- Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents☆374Updated 6 months ago
- GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆309Updated last month
- [CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Gener…☆289Updated 4 months ago
- "GraphAgent: Agentic Graph Language Assistant"☆309Updated 6 months ago
- PodAgent: A Comprehensive Framework for Podcast Generation☆113Updated 2 months ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆160Updated 4 months ago
- PresentAgent: Multimodal Agent for Presentation Video Generation☆91Updated last week
- OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking☆454Updated 3 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆136Updated 2 months ago
- ☆260Updated 11 months ago
- ☆37Updated 8 months ago
- project page for ChatAnyone☆111Updated 4 months ago
- SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.☆91Updated 3 months ago
- 🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents☆1,045Updated last week
- "Hyper-RAG: Combating LLM Hallucinations using Hypergraph-Driven Retrieval-Augmented Generation" by Yifan Feng, Hao Hu, Xingliang Hou, Sh…☆163Updated this week
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆205Updated last month
- LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)☆249Updated 3 weeks ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆80Updated 2 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆214Updated last month
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆223Updated last month
- MemoryOS is designed to provide a memory operating system for personalized AI agents.☆581Updated this week
- mem1是mem0的魔改版本。我的魔改能让它生成效果更可用和更适合做情感陪伴项目☆28Updated 8 months ago
- ☆288Updated 2 months ago
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆239Updated last month
- ☆63Updated 2 months ago
- A open version Manus.☆62Updated 4 months ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2☆125Updated 8 months ago