xxyQwQ / ComfyBench
Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".
β159Updated last month
Alternatives and similar repositories for ComfyBench:
Users that are interested in ComfyBench are comparing it to the libraries listed below
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customizationβ207Updated this week
- π₯ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Promptβ237Updated 2 weeks ago
- [CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Generβ¦β239Updated last week
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planningβ171Updated 3 weeks ago
- β175Updated 9 months ago
- All-round Creator and Editorβ212Updated 3 months ago
- β273Updated 3 weeks ago
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.β62Updated 3 months ago
- π‘ VideoMind: A Chain-of-LoRA Agent for Long Video Reasoningβ166Updated last week
- The best OSS video generation modelsβ132Updated 5 months ago
- The official implementation of "MagicColor: Multi-Instance Sketch Colorization"β93Updated last week
- π₯ CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Modelsβ209Updated 9 months ago
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animationβ234Updated 2 months ago
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Modelβ238Updated 8 months ago
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generationβ58Updated 6 months ago
- β223Updated 8 months ago
- Deep Reasoning Translation via Reinforcement Learning (arXiv preprint 2025); DRT: Deep Reasoning Translation via Long Chain-of-Thought (aβ¦β214Updated this week
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effortβ150Updated 4 months ago
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.β183Updated 8 months ago
- UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalizationβ245Updated 6 months ago
- β198Updated last year
- Official implementation of the paper "MusicInfuser: Making Video Diffusion Listen and Dance"β63Updated last week
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"β127Updated 4 months ago
- Official code of "MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation"β171Updated 2 weeks ago
- Pusa: Thousands Timesteps Video Diffusion Modelβ125Updated this week
- β473Updated 4 months ago
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video β¦β116Updated 3 weeks ago
- Any-length Video Inpainting and Editing with Plug-and-Play Context Controlβ326Updated last week
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequencesβ295Updated 8 months ago
- The project page of Diffutoonβ26Updated last year