StevenGrove / ComfyUI-YOLOWorldLinks
ComfyUI YOLO-World Integration
☆42Updated 11 months ago
Alternatives and similar repositories for ComfyUI-YOLOWorld
Users that are interested in ComfyUI-YOLOWorld are comparing it to the libraries listed below
Sorting:
- Official repo for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆129Updated 3 weeks ago
- ☆22Updated 5 months ago
- Florence-2☆67Updated 3 months ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆135Updated 4 months ago
- Image Editing Anything☆114Updated 2 years ago
- A Gradio component that can be used to annotate images with bounding boxes.☆52Updated 3 months ago
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.☆63Updated 5 months ago
- Codebase for the Recognize Anything Model (RAM)☆79Updated last year
- ☆25Updated last year
- This repository is the official implementation of FLUX-CustomID. It is capable of generating images based on your face image at a level e…☆21Updated 6 months ago
- ☆15Updated 5 months ago
- Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance☆75Updated 4 months ago
- finetune your florence2 model easy☆18Updated 10 months ago
- Our 2nd-gen LMM☆33Updated last year
- ☆84Updated 9 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆119Updated 5 months ago
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆43Updated last month
- Finetuning and inference tools for the CogView4 and CogVideoX model series.☆70Updated 3 weeks ago
- Image Prompter for Gradio☆89Updated last year
- A simple tool to guess an HuggingFace repo URL from a state dict.☆43Updated 7 months ago
- official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark☆37Updated last year
- YOLO-World + EfficientViT SAM☆98Updated last year
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆65Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆34Updated 2 weeks ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆124Updated 9 months ago
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆59Updated 2 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆64Updated 9 months ago
- MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models☆111Updated 2 weeks ago
- An official implementation of SwapAnyone.☆62Updated 2 months ago