edgarGracia / gradio_image_annotatorLinks
A Gradio component that can be used to annotate images with bounding boxes.
☆63Updated 3 weeks ago
Alternatives and similar repositories for gradio_image_annotator
Users that are interested in gradio_image_annotator are comparing it to the libraries listed below
Sorting:
- Image Prompter for Gradio☆91Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 11 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- ImageSlider custom component for gradio.☆43Updated last year
- Data release for the ImageInWords (IIW) paper.☆221Updated 11 months ago
- Diffusers training with mmengine☆101Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- A component that allows you to annotate an image with points and boxes.☆21Updated last year
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆142Updated 9 months ago
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆85Updated last year
- VimTS: A Unified Video and Image Text Spotter☆78Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated last year
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆125Updated 3 months ago
- Modern Stable Diffusion models family - Fluently☆32Updated last year
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆134Updated last year
- ☆69Updated last year
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆120Updated 10 months ago
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆279Updated last year
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆157Updated 2 years ago
- ☆194Updated last year
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆58Updated 11 months ago
- Build your own Face App with Stable Diffusion 2.1☆154Updated 10 months ago
- ☆208Updated last year
- ComfyUI YOLO-World Integration☆48Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Updated 2 years ago
- Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance☆75Updated 4 months ago
- Codebase for the Recognize Anything Model (RAM)☆87Updated last year
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆136Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year