edgarGracia / gradio_image_annotatorLinks
A Gradio component that can be used to annotate images with bounding boxes.
☆66Updated this week
Alternatives and similar repositories for gradio_image_annotator
Users that are interested in gradio_image_annotator are comparing it to the libraries listed below
Sorting:
- Image Prompter for Gradio☆92Updated 2 years ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated last year
- Data release for the ImageInWords (IIW) paper.☆224Updated last year
- Diffusers training with mmengine☆102Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆70Updated last year
- A component that allows you to annotate an image with points and boxes.☆21Updated 2 years ago
- ImageSlider custom component for gradio.☆43Updated last year
- VimTS: A Unified Video and Image Text Spotter☆79Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Updated last year
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆134Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆69Updated last year
- High-throughput tensor loading for PyTorch☆221Updated last week
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆86Updated 2 years ago
- ☆69Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆132Updated last year
- Modern Stable Diffusion models family - Fluently☆32Updated last year
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆43Updated last year
- Open-Sora: Democratizing Efficient Video Production for All☆19Updated last year
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆131Updated 2 months ago
- Build your own Face App with Stable Diffusion 2.1☆154Updated last year
- Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance☆75Updated 7 months ago
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆280Updated last year
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆136Updated last year
- faster parallel inference of mochi-1 video generation model☆126Updated 11 months ago
- ☆27Updated last year
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆179Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆66Updated last year
- ☆86Updated last year
- Fine-tune of Florence-2 for shot categorization.☆26Updated 10 months ago
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Updated 2 weeks ago