mailcorahul / auto_labeler
auto_labeler - An all-in-one library to automatically label vision data
☆12Updated last week
Alternatives and similar repositories for auto_labeler:
Users that are interested in auto_labeler are comparing it to the libraries listed below
- Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics inclu…☆43Updated 2 weeks ago
- CoRL 2024☆368Updated 3 months ago
- EdgeSAM model for use with Autodistill.☆26Updated 7 months ago
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆18Updated 2 weeks ago
- RobustSAM: Segment Anything Robustly on Degraded Images (CVPR 2024 Highlight)☆332Updated 4 months ago
- Run zero-shot prediction models on your data☆30Updated last month
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆61Updated 5 months ago
- (CVPR 2024) Point, Segment and Count: A Generalized Framework for Object Counting☆105Updated 2 months ago
- Continuation of an abandoned project fast-coco-eval☆84Updated last week
- An SDK for Transformers + YOLO and other SSD family models☆58Updated this week
- Includes the code for training and testing the CountGD model from the paper CountGD: Multi-Modal Open-World Counting.☆136Updated 3 weeks ago
- [CVPR 2024] Official implementation of the paper "Visual In-context Learning"☆430Updated 9 months ago
- Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"☆340Updated 2 weeks ago
- 🤩 An AWESOME Curated List of Papers, Workshops, Datasets, and Challenges from CVPR 2024☆142Updated 7 months ago
- Quick exploration into fine tuning florence 2☆292Updated 4 months ago
- ☆47Updated 2 months ago
- The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"☆238Updated last week
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆172Updated last year
- ☆19Updated last week
- A tool for converting computer vision label formats.☆58Updated 2 weeks ago
- Code for replicating Roboflow 100 benchmark results and programmatically downloading benchmark datasets☆260Updated 3 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆52Updated last week
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆128Updated last month
- [TPAMI'24] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation☆239Updated 2 months ago
- CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks☆385Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆64Updated last year
- GroundedSAM Base Model plugin for Autodistill☆47Updated 9 months ago
- Data release for the ImageInWords (IIW) paper.☆206Updated 2 months ago
- [NeurIPS 2024] Code release for "Segment Anything without Supervision"☆442Updated 3 months ago
- Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"☆143Updated this week