Stability-AI / Stable-Grounded-Segment-AnythingLinks
Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
☆11Updated last year
Alternatives and similar repositories for Stable-Grounded-Segment-Anything
Users that are interested in Stable-Grounded-Segment-Anything are comparing it to the libraries listed below
Sorting:
- ☆24Updated last year
- Modern Stable Diffusion models family - Fluently☆32Updated last year
- ☆16Updated last year
- ☆13Updated last year
- ☆31Updated last year
- My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML☆26Updated last year
- lightweight LAMA inference wrapper☆25Updated last year
- ☆16Updated last year
- XGEN-MM(BLIP3) Autocaptioning Tools☆16Updated last year
- ☆14Updated 4 months ago
- ☆9Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Updated last year
- Generate images from an initial frame and text☆37Updated last year
- A fast approach for translating a series of text prompts into a video. The 2022 NeurIPS Workshop on Machine Learning for Creativity and D…☆32Updated 2 years ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Updated 8 months ago
- Code for the paper "Manipulating Embeddings of Stable Diffusion Prompts".☆14Updated 11 months ago
- ☆13Updated 2 years ago
- ☆14Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated 11 months ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆58Updated 7 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 6 months ago
- ☆32Updated 10 months ago
- a naive 3d human pose editor GUI.☆19Updated 2 years ago
- ☆16Updated last year
- FlexiFilm: Long Video Generation with Flexible Conditions☆31Updated last year
- ☆13Updated last year
- ☆17Updated last year
- Controlling diffusion-based image generation with just a few strokes☆63Updated last year
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆48Updated 5 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year