CharlesCNorton / yoflo-gui
Real-time, YOLO-like object detection using the Florence-2-base-ft model with a user-friendly GUI.
☆19Updated 2 months ago
Alternatives and similar repositories for yoflo-gui:
Users that are interested in yoflo-gui are comparing it to the libraries listed below
- EdgeSAM model for use with Autodistill.☆26Updated 9 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆62Updated 7 months ago
- A simple demo for utilizing grounding dino and segment anything v2 models together☆19Updated 7 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆35Updated last year
- ☆23Updated 5 months ago
- vision language models finetuning notebooks & use cases (paligemma - florence .....)☆19Updated 5 months ago
- Code of paper "A new baseline for edge detection: Make Encoder-Decoder great again"☆37Updated last month
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆57Updated 3 weeks ago
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆31Updated 5 months ago
- SAM-CLIP module for use with Autodistill.☆14Updated last year
- GroundedSAM Base Model plugin for Autodistill☆49Updated 11 months ago
- 🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)☆29Updated last month
- ☆31Updated 3 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated 11 months ago
- TensorFlow implementation of a comprehensive comparison of various SSL (Semi-Supervised Learning) approaches in image segmentation, featu…☆18Updated 4 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆29Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆55Updated last year
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆22Updated 2 months ago
- ☆58Updated last year
- ☆56Updated 3 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆19Updated 4 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 7 months ago
- [WACV 2025] Official implementation of "Online-LoRA: Task-free Online Continual Learning via Low Rank Adaptation" by Xiwen Wei, Guihong L…☆33Updated 4 months ago
- [ECCV 2024] Official Release of SILC: Improving vision language pretraining with self-distillation☆40Updated 5 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆52Updated 6 months ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆15Updated 8 months ago