PhyscalX / gradio-image-prompterLinks
Image Prompter for Gradio
☆92Updated last year
Alternatives and similar repositories for gradio-image-prompter
Users that are interested in gradio-image-prompter are comparing it to the libraries listed below
Sorting:
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆238Updated 5 months ago
- Codebase for the Recognize Anything Model (RAM)☆82Updated last year
- ☆20Updated 2 years ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆126Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆61Updated last month
- Data release for the ImageInWords (IIW) paper.☆216Updated 8 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆128Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- Diffusers training with mmengine☆102Updated last year
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆438Updated 4 months ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆211Updated 4 months ago
- Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗☆264Updated 5 months ago
- Gradio UI for running Meta AI's Segment Anything on own hardware. Promptable segmentation via keypoints and bounding boxes.☆66Updated 2 years ago
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆125Updated 10 months ago
- ☆188Updated 2 months ago
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated last year
- ☆235Updated 3 months ago
- [NeurIPS 2024] SlimSAM: 0.1% Data Makes Segment Anything Slim☆338Updated 5 months ago
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆278Updated last year
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆137Updated 6 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆99Updated last year
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆120Updated 7 months ago
- Official Code for Tracking Any Object Amodally☆118Updated last year
- 1-shot image segmentation using Stable Diffusion☆140Updated last year
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆27Updated 11 months ago
- Image Editing Anything☆116Updated 2 years ago
- [ICCV2025] Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆149Updated 3 months ago
- ZIM: Zero-Shot Image Matting for Anything☆330Updated 8 months ago
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆125Updated 11 months ago
- 🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"☆148Updated 3 weeks ago