xxyQwQ / ComfyBench
Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".
โ135Updated 3 weeks ago
Alternatives and similar repositories for ComfyBench:
Users that are interested in ComfyBench are comparing it to the libraries listed below
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customizationโ186Updated this week
- ๐ฅ CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Modelsโ203Updated 7 months ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Enginesโ114Updated 3 months ago
- A diffusers pipeline for zero shot stylised couples portrait creationโ100Updated 2 months ago
- All-round Creator and Editorโ188Updated last month
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"โ124Updated 3 months ago
- SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2โ112Updated 3 months ago
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.โ60Updated last month
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.โ53Updated 3 months ago
- โ173Updated 7 months ago
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thoughtโ207Updated last month
- [IJCV'24] AutoStory: Generating Diverse Storytelling Images with Minimal Human Effortโ146Updated 2 months ago
- latent Diffusion Image generation testsโ52Updated this week
- An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community โฆโ58Updated last week
- โ209Updated 6 months ago
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.โ182Updated 6 months ago
- The best OSS video generation modelsโ131Updated 3 months ago
- The project page of Diffutoonโ27Updated last year
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Modelโ236Updated 6 months ago
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.โ212Updated last week
- Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3Dโฆโ34Updated 2 weeks ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.โ121Updated this week
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (ๆ ้้ขๅค่ฎญ็ปไธบไปปๆๆฉๆฃๆจกๅๆฏๆๅค่ฏญ่จ่ฝๅ)โ131Updated 3 weeks ago
- Official repository of "TryOffAnyone: Tiled Cloth Generation from a Dressed Person"โ145Updated 2 weeks ago
- UniPortrait: A Unified Framework for Identity-Preserving Single- and Multi-Human Image Personalizationโ233Updated 4 months ago
- โ197Updated last year
- AuraSR in ComfyUI for img & videoโ89Updated 7 months ago
- โ29Updated 3 weeks ago