okaris / grounded-segmentationLinks
A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integration of powerful object detection and segmentation models, offering an easy-to-use interface for developers seeking efficient image analysis capabilities without complex setups.
☆64Updated 10 months ago
Alternatives and similar repositories for grounded-segmentation
Users that are interested in grounded-segmentation are comparing it to the libraries listed below
Sorting:
- Community ComfyUI workflows running on fal.ai☆58Updated 11 months ago
- Gradio UI for a Cog API☆69Updated last year
- Gradio app to track objects in video and add visual effects☆17Updated 2 weeks ago
- Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation☆119Updated 6 months ago
- ☆66Updated 9 months ago
- A minimalistic, hackable code base to finetune Wan video generation model☆43Updated 3 months ago
- ☆32Updated last year
- Simple LaMa Inpainting: An easy-to-use implementation of the LaMa (Large Mask) inpainting model. Remove unwanted objects or fill in missi…☆22Updated 9 months ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆59Updated 7 months ago
- ☆13Updated last year
- ☆12Updated last year
- ☆11Updated last year
- ☆30Updated 9 months ago
- sd3 dreambooth lora training book, adapted from the diffusers doc☆45Updated last year
- ☆17Updated last year
- ☆16Updated last year
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Updated 2 years ago
- ☆22Updated last year
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆68Updated 9 months ago
- Official Implementation of "Instance Segmentation of Scene Sketches Using Natural Image Priors" (SIGGRAPH 2025)☆46Updated 3 weeks ago
- A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.☆60Updated 8 months ago
- ☆45Updated 8 months ago
- ☆92Updated this week
- ☆25Updated last year
- ☆17Updated last year
- ☆16Updated last year
- JAX port of FLUX.1 models using flax.nnx☆24Updated 10 months ago
- ☆24Updated last year
- ☆19Updated 11 months ago
- ☆29Updated last year