okaris / grounded-segmentationLinks
A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integration of powerful object detection and segmentation models, offering an easy-to-use interface for developers seeking efficient image analysis capabilities without complex setups.
☆66Updated last year
Alternatives and similar repositories for grounded-segmentation
Users that are interested in grounded-segmentation are comparing it to the libraries listed below
Sorting:
- Community ComfyUI workflows running on fal.ai☆57Updated last year
- Gradio UI for a Cog API☆70Updated last year
- Gradio app to track objects in video and add visual effects☆17Updated 6 months ago
- Optimizing diffusion for production-ready speeds☆34Updated last month
- Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation☆119Updated last year
- ☆69Updated last year
- ☆13Updated last year
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated last year
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆69Updated last year
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆60Updated last year
- ☆17Updated 2 years ago
- ☆46Updated 2 months ago
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Updated 2 years ago
- ☆30Updated last year
- ☆12Updated 2 years ago
- ☆15Updated last year
- ☆25Updated 2 years ago
- ☆11Updated 2 years ago
- sd3 dreambooth lora training book, adapted from the diffusers doc☆48Updated last year
- Build your own Face App with Stable Diffusion 2.1☆154Updated last year
- ☆27Updated last year
- ☆175Updated 3 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆70Updated last year
- ☆29Updated 2 years ago
- ☆24Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated last year
- Cog wrapper for FalconsAi / nsfw_image_detection☆18Updated 6 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- ☆94Updated 5 months ago
- ☆29Updated 2 years ago