edgarGracia / gradio_image_annotatorLinks
A Gradio component that can be used to annotate images with bounding boxes.
☆63Updated 2 months ago
Alternatives and similar repositories for gradio_image_annotator
Users that are interested in gradio_image_annotator are comparing it to the libraries listed below
Sorting:
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 11 months ago
- Image Prompter for Gradio☆91Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- Data release for the ImageInWords (IIW) paper.☆220Updated 11 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- ImageSlider custom component for gradio.☆42Updated last year
- VimTS: A Unified Video and Image Text Spotter☆78Updated 11 months ago
- Diffusers training with mmengine☆101Updated last year
- faster parallel inference of mochi-1 video generation model☆125Updated 8 months ago
- A component that allows you to annotate an image with points and boxes.☆21Updated last year
- ☆70Updated last year
- Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing"☆85Updated last year
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆132Updated last year
- ☆86Updated last year
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆124Updated 3 months ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆140Updated 9 months ago
- Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance☆75Updated 4 months ago
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆42Updated last year
- Making Flux go brrr on GPUs.☆150Updated 3 months ago
- ☆26Updated last year
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆177Updated last year
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆277Updated last year
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆157Updated 2 years ago
- The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting☆120Updated 9 months ago
- ☆59Updated last year
- ☆206Updated last year
- Official repository for VQDM:Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization paper☆34Updated last year
- Build your own Face App with Stable Diffusion 2.1☆152Updated 9 months ago
- Scripts to teach Flux the task of image editing from language with the Flux Control framework.☆94Updated 3 months ago