StevenGrove / ComfyUI-YOLOWorldLinks
ComfyUI YOLO-World Integration
☆48Updated last year
Alternatives and similar repositories for ComfyUI-YOLOWorld
Users that are interested in ComfyUI-YOLOWorld are comparing it to the libraries listed below
Sorting:
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆143Updated 10 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆163Updated 5 months ago
- Image Prompter for Gradio☆92Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆63Updated last month
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.☆65Updated 11 months ago
- Official Repo For the [AAAI'26 Oral] Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”☆22Updated this week
- ☆58Updated 2 weeks ago
- Codebase for the Recognize Anything Model (RAM)☆87Updated 2 years ago
- Florence-2☆71Updated 9 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆134Updated last year
- ☆26Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- finetune your florence2 model easy☆19Updated last year
- Image Editing Anything☆116Updated 2 years ago
- A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design☆133Updated 6 months ago
- ☆208Updated last year
- ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback☆121Updated 2 months ago
- ☆16Updated 4 months ago
- VimTS: A Unified Video and Image Text Spotter☆79Updated last year
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆121Updated 11 months ago
- Try-On Master-1: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-wise Diffusion Transformer Framework☆138Updated 4 months ago
- Official repository of "FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring"☆27Updated last week
- Diffusers training with mmengine☆102Updated last year
- YOLO-World + EfficientViT SAM☆106Updated last year
- Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance☆75Updated 5 months ago
- Our 2nd-gen LMM☆34Updated last year
- ☆184Updated 4 months ago
- Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model☆240Updated 7 months ago
- MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆234Updated last year
- RepText: Rendering Visual Text via Replicating 🔥☆139Updated 6 months ago