okaris / grounded-segmentationLinks
A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integration of powerful object detection and segmentation models, offering an easy-to-use interface for developers seeking efficient image analysis capabilities without complex setups.
☆64Updated 8 months ago
Alternatives and similar repositories for grounded-segmentation
Users that are interested in grounded-segmentation are comparing it to the libraries listed below
Sorting:
- Community ComfyUI workflows running on fal.ai☆57Updated 9 months ago
- Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation☆118Updated 5 months ago
- ☆46Updated 7 months ago
- Gradio app to track objects in video and add visual effects☆16Updated last month
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆65Updated 7 months ago
- Gradio UI for a Cog API☆68Updated last year
- A minimalistic, hackable code base to finetune Wan video generation model☆40Updated 2 months ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated 9 months ago
- sd3 dreambooth lora training book, adapted from the diffusers doc☆45Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆62Updated 3 months ago
- ☆22Updated 8 months ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆58Updated 6 months ago
- ☆67Updated 7 months ago
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆20Updated 9 months ago
- Simple LaMa Inpainting: An easy-to-use implementation of the LaMa (Large Mask) inpainting model. Remove unwanted objects or fill in missi…☆22Updated 7 months ago
- ☆13Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 7 months ago
- ☆30Updated 8 months ago
- ☆12Updated 8 months ago
- ☆32Updated last year
- ☆34Updated last month
- ☆28Updated 10 months ago
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Updated last year
- Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D…☆36Updated 4 months ago
- A gradio based image captioning tool that uses the GPT-4-Vision API to generate detailed descriptions of images.☆59Updated 7 months ago
- Gradio webapp to train AI Video models using Finetrainers☆36Updated 2 weeks ago
- Fine-tune of Florence-2 for shot categorization.☆24Updated 3 months ago
- ☆71Updated 8 months ago
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (arXiv, 2024)☆51Updated 6 months ago