A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integration of powerful object detection and segmentation models, offering an easy-to-use interface for developers seeking efficient image analysis capabilities without complex setups.
☆66Sep 30, 2024Updated last year
Alternatives and similar repositories for grounded-segmentation
Users that are interested in grounded-segmentation are comparing it to the libraries listed below
Sorting:
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- ☆16Apr 23, 2024Updated last year
- ☆33Nov 4, 2024Updated last year
- ☆19Jul 11, 2024Updated last year
- A diffusers pipeline for zero shot stylised couples portrait creation☆100Dec 10, 2024Updated last year
- ☆33Aug 9, 2024Updated last year
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆16Aug 30, 2024Updated last year
- Custom Background Remover for ComfyUI to address some issues I've encountered with different background removers.☆49Aug 8, 2025Updated 6 months ago
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Jan 14, 2026Updated last month
- Create Latents with Perlin Noise in any shape (dimensionality). Works with Flux, SD3 and other 16d latent models.☆34Aug 6, 2024Updated last year
- ☆17Sep 1, 2024Updated last year
- ☆20Jun 26, 2024Updated last year
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆35May 10, 2025Updated 9 months ago
- Simple LaMa Inpainting: An easy-to-use implementation of the LaMa (Large Mask) inpainting model. Remove unwanted objects or fill in missi …☆23Nov 5, 2024Updated last year
- A system for Prompt generation to improve Text-to-Image performance.☆93Feb 7, 2026Updated 3 weeks ago
- Animefy: ComfyUI workflow designed to convert images or videos into an anime-like style automatically.☆22Jul 2, 2024Updated last year
- ☆36Oct 12, 2024Updated last year
- Controlling diffusion-based image generation with just a few strokes☆64Dec 21, 2023Updated 2 years ago
- Simple, Efficient, and Effective Negative Guidance in Few-Step Image Generation Models By Value Sign Flip☆37Jan 27, 2026Updated last month
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆44Aug 22, 2024Updated last year
- Official implementation of "Single Image Iterative Subject-driven Generation and Editing".☆100May 30, 2025Updated 9 months ago
- ☆64May 3, 2025Updated 10 months ago
- JAX port of FLUX.1 models using flax.nnx☆24Sep 28, 2024Updated last year
- a comfyui custom node for I2V-Adapter☆21Jul 2, 2024Updated last year
- ☆25Mar 30, 2025Updated 11 months ago
- A platform aimed at creating websites that perform self-optimization☆12May 4, 2024Updated last year
- Designed to help lawyers and legal professionals find precedent fast and prepare for case negotiations by simulating trajectories☆10Oct 16, 2024Updated last year
- ☆45Dec 1, 2025Updated 3 months ago
- ☆46Nov 20, 2025Updated 3 months ago
- Browser viewer for GaussianAvatars based on Brush☆25Dec 23, 2024Updated last year
- ☆49Nov 8, 2025Updated 3 months ago
- ☆44Jul 3, 2024Updated last year
- [ECCV 2024] PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance☆23Jul 25, 2024Updated last year
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆22Jul 21, 2024Updated last year
- Uncertainty-Aware Rotation Estimation in Manhattan Environments using only monocular cues.☆77Jan 16, 2026Updated last month
- No code solution for training tabular models☆34Jan 25, 2026Updated last month
- Python library for talking to Apollo API☆10Jan 31, 2024Updated 2 years ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Feb 27, 2025Updated last year
- ☆18Nov 20, 2024Updated last year