okaris / grounded-segmentationLinks
A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integration of powerful object detection and segmentation models, offering an easy-to-use interface for developers seeking efficient image analysis capabilities without complex setups.
☆66Updated last year
Alternatives and similar repositories for grounded-segmentation
Users that are interested in grounded-segmentation are comparing it to the libraries listed below
Sorting:
- Community ComfyUI workflows running on fal.ai☆57Updated last year
- Gradio UI for a Cog API☆71Updated last year
- Gradio app to track objects in video and add visual effects☆17Updated 5 months ago
- Optimizing diffusion for production-ready speeds☆31Updated last week
- ☆13Updated last year
- Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation☆119Updated 11 months ago
- ☆69Updated last year
- ☆30Updated last year
- ☆171Updated 2 months ago
- Use miniGPT-4 batch to generate captions for a lot of images! You should be able to create the best captions you always wanted!☆18Updated 2 years ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆60Updated last year
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated last year
- ☆17Updated last year
- ☆27Updated last year
- ☆25Updated 2 years ago
- ☆95Updated 4 months ago
- A minimalistic, hackable code base to finetune Wan video generation model☆48Updated 9 months ago
- ☆23Updated last year
- ☆17Updated 2 years ago
- JAX port of FLUX.1 models using flax.nnx☆24Updated last year
- ☆11Updated last year
- ☆24Updated last year
- ☆12Updated 2 years ago
- ☆32Updated last year
- ☆46Updated last month
- Build your own Face App with Stable Diffusion 2.1☆154Updated last year
- sd3 dreambooth lora training book, adapted from the diffusers doc☆48Updated last year
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆62Updated 2 years ago
- ☆16Updated last year
- [NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead☆42Updated 3 months ago