PhyscalX / gradio-image-prompterLinks
Image Prompter for Gradio
☆92Updated last year
Alternatives and similar repositories for gradio-image-prompter
Users that are interested in gradio-image-prompter are comparing it to the libraries listed below
Sorting:
- [ICCV2023] Segment Every Reference Object in Spatial and Temporal Spaces☆237Updated 9 months ago
- Codebase for the Recognize Anything Model (RAM)☆87Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆63Updated last month
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆132Updated last year
- Data release for the ImageInWords (IIW) paper.☆223Updated last year
- [ICCV 2025, Highlight] ZIM: Zero-Shot Image Matting for Anything☆379Updated 3 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆121Updated 11 months ago
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆109Updated 2 weeks ago
- Pytorch Implementation of "SMITE: Segment Me In TimE" (ICLR 2025)☆212Updated 3 weeks ago
- ☆126Updated last year
- [NeurIPS 2024] Official Implementation of CLIPAway☆102Updated 6 months ago
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆128Updated last year
- ☆180Updated 3 weeks ago
- A Diffusion training toolbox based on diffusers and existing SOTA methods, including Dreambooth, Texual Inversion, LoRA, Custom Diffusion…☆82Updated last year
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated last year
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆143Updated 10 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆65Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated last year
- Official Code for Tracking Any Object Amodally☆120Updated last year
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆134Updated last year
- [CVPR 2024 Highlight] Official repo: SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing☆53Updated last year
- Image Editing Anything☆116Updated 2 years ago
- ☆195Updated 6 months ago
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆488Updated 8 months ago
- Diffusers training with mmengine☆102Updated last year
- Repo for Qwen Image Finetune☆147Updated last week
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆28Updated last year
- Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗☆291Updated 9 months ago
- "FreeU: Free Lunch in Diffusion U-Net" for Huggingface Diffusers☆101Updated 2 years ago