okaris / grounded-segmentation
A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integration of powerful object detection and segmentation models, offering an easy-to-use interface for developers seeking efficient image analysis capabilities without complex setups.
☆63Updated 6 months ago
Alternatives and similar repositories for grounded-segmentation:
Users that are interested in grounded-segmentation are comparing it to the libraries listed below
- Community ComfyUI workflows running on fal.ai☆57Updated 7 months ago
- Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation☆117Updated 3 months ago
- Gradio app to track objects in video and add visual effects☆16Updated 7 months ago
- sd3 dreambooth lora training book, adapted from the diffusers doc☆45Updated 10 months ago
- Gradio UI for a Cog API☆67Updated last year
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆55Updated 4 months ago
- ☆46Updated 5 months ago
- ☆30Updated 6 months ago
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆63Updated 5 months ago
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆19Updated 7 months ago
- ☆32Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆50Updated 5 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆67Updated 11 months ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated 7 months ago
- A unified media (Image, Video, Audio, Text) diffusion repository, for education and learning.☆15Updated 2 weeks ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆52Updated last month
- ☆28Updated 8 months ago
- ☆13Updated last year
- A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.☆59Updated 5 months ago
- ☆30Updated 2 months ago
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Updated last year
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.☆62Updated 4 months ago
- ☆12Updated 6 months ago
- ☆22Updated 6 months ago
- ☆16Updated last year
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (arXiv, 2024)☆50Updated 4 months ago
- Fine-tune of Florence-2 for shot categorization.☆24Updated last month
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- Build your own Face App with Stable Diffusion 2.1☆151Updated 3 months ago
- ☆29Updated last year