showlab / Code2VideoLinks
Video generation via code
☆56Updated this week
Alternatives and similar repositories for Code2Video
Users that are interested in Code2Video are comparing it to the libraries listed below
Sorting:
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆130Updated 10 months ago
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".☆185Updated 7 months ago
- The official GitHub Page for MiniMax☆55Updated 3 months ago
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆307Updated this week
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization☆218Updated 5 months ago
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆247Updated 6 months ago
- [ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible☆102Updated last month
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆259Updated last week
- Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D…☆37Updated 8 months ago
- [ICCV2025] WikiAutoGen offical page☆18Updated 3 months ago
- [ECCV 2024] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models☆75Updated 11 months ago
- [EMNLP 2025 Demo] PresentAgent: Multimodal Agent for Presentation Video Generation☆100Updated 3 weeks ago
- Chrome / Edge extension to turn arXiv papers into Markdown codes in one click.☆83Updated 6 months ago
- Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)☆166Updated last year
- ☆124Updated 2 months ago
- Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation☆143Updated 11 months ago
- An AI-driven daily arXiv paper crawler, analyzer, and organizer tool, focusing on AIGC☆63Updated this week
- 🔥ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt☆295Updated 4 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated last year
- AudioStory: Generating Long-Form Narrative Audio with Large Language Models☆279Updated 2 weeks ago
- (CVPR 2025) Code of "Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models"☆187Updated 6 months ago
- [CVPR 2025] EgoLife: Towards Egocentric Life Assistant☆332Updated 6 months ago
- Controllable Animation Video Generation with Large Models-based Multimodal Agents☆199Updated this week
- Official PyTorch implementation of TokenSet.☆123Updated 6 months ago
- ☆69Updated 11 months ago
- ☆183Updated 2 months ago
- Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆127Updated 2 months ago
- ☆42Updated last month
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.☆64Updated 9 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆49Updated 7 months ago