StevenGrove / ComfyUI-YOLOWorld
ComfyUI YOLO-World Integration
☆42Updated 9 months ago
Alternatives and similar repositories for ComfyUI-YOLOWorld:
Users that are interested in ComfyUI-YOLOWorld are comparing it to the libraries listed below
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆135Updated 3 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆63Updated 8 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆119Updated 8 months ago
- Codebase for the Recognize Anything Model (RAM)☆78Updated last year
- ☆25Updated 10 months ago
- A simple tool to guess an HuggingFace repo URL from a state dict.☆40Updated 5 months ago
- ☆32Updated last year
- Diffusers training with mmengine☆100Updated last year
- Florence-2☆63Updated 2 months ago
- ☆22Updated 4 months ago
- Image Prompter for Gradio☆88Updated last year
- YOLO-World + EfficientViT SAM☆97Updated last year
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆240Updated 2 months ago
- ☆14Updated 3 months ago
- Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation - NeurIPS 2024☆104Updated 4 months ago
- TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation☆58Updated 6 months ago
- Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance☆73Updated 2 months ago
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆23Updated 10 months ago
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆64Updated 5 months ago
- A Gradio component that can be used to annotate images with bounding boxes.☆49Updated last month
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆49Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- Modern Stable Diffusion models family - Fluently☆30Updated 10 months ago
- Training InstructPi2Pix with SDXL.☆18Updated last year
- Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.☆63Updated last year
- This repository contains code for deploying a Gradio application using the SAM2 model for video processing. The application allows users …☆39Updated 7 months ago
- Stable Diffusion in TensorRT 8.5+☆14Updated 2 years ago
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.☆62Updated 4 months ago
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆41Updated last year
- Fine-tuning code for CLIP models☆219Updated last month