chenxwh / Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs
☆15Updated last year
Alternatives and similar repositories for Grounded-Segment-Anything:
Users that are interested in Grounded-Segment-Anything are comparing it to the libraries listed below
- Implementation of Grounding DINO & Segment Anything, and it allows masking based on prompt, which is useful for programmed inpainting.☆37Updated last year
- ☆22Updated last year
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆235Updated 2 weeks ago
- Cog wrapper for MagicAnimate☆30Updated last year
- Stable Fashion: A prompt based virtual try on repository☆87Updated 2 years ago
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆55Updated 4 months ago
- ☆87Updated last year
- ☆29Updated last year
- ☆15Updated 8 months ago
- StoryDiffusion serverless worker☆16Updated 9 months ago
- Cog wrapper for faceswap with face enhancer☆13Updated last year
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆66Updated last year
- ☆29Updated last year
- A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.☆66Updated last year
- A cog model for Ultimate SD Upscale with ControlNet tile via ComfyUI☆32Updated last year
- ☆23Updated last year
- A quality zero-shot lipsync pipeline built with MuseTalk, LivePortrait, and CodeFormer.☆34Updated 5 months ago
- [WIP] AI Try-On plugin for Chrome☆27Updated 11 months ago
- Transfer the style of your video. Use on ClarityAI.co☆69Updated 7 months ago
- ☆22Updated 3 months ago
- ☆50Updated 5 months ago
- How to Build an AI Children’s Book Service☆24Updated last year
- ☆60Updated last year
- Replicate Flux LoRA image editor.☆46Updated 6 months ago
- A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.☆121Updated last month
- ☆44Updated 3 months ago
- Community ComfyUI workflows running on fal.ai☆57Updated 6 months ago
- ☆71Updated 5 months ago
- An open source, layer-based web interface for Collage Diffusion - use a familiar Photoshop-like interface and let the AI harmonize the de…☆64Updated last year
- Gradio UI for a Cog API☆66Updated 10 months ago