danielgural / semantic_video_searchLinks
A FiftyOne Plugin that allows you to search across any modality in your videos!
☆21Updated 3 months ago
Alternatives and similar repositories for semantic_video_search
Users that are interested in semantic_video_search are comparing it to the libraries listed below
Sorting:
- ☆76Updated 2 months ago
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆69Updated 4 months ago
- Timm model explorer☆41Updated last year
- FiftyOne Plugin for finding common image quality issues☆33Updated 10 months ago
- Compare Savant and PyTorch performance☆13Updated last year
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆26Updated 8 months ago
- ☆59Updated last year
- The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"☆246Updated 7 months ago
- Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics inclu…☆54Updated 8 months ago
- Command-line tool for extracting DINO, CLIP, and SigLIP2 features for images and videos☆32Updated last month
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- A component that allows you to annotate an image with points and boxes.☆21Updated last year
- Supercharge Your PyTorch Image Models: Bag of Tricks to 8x Faster Inference with ONNX Runtime & Optimizations☆23Updated 11 months ago
- An ONNX-based implementation of the CLIP model that doesn't depend on torch or torchvision.☆73Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆58Updated last year
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- Efficient parallelizable algorithms for multidimensional arrays to speed up your data pipelines☆22Updated last month
- Official Code for Tracking Any Object Amodally☆118Updated last year
- 4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022☆42Updated 2 years ago
- Run SOTA Vision-Language Model Florence-2 on your data!☆13Updated 5 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆158Updated last year
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆135Updated 2 weeks ago
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆41Updated last year
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆128Updated last year
- A tool for converting computer vision label formats.☆73Updated this week
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆67Updated last year
- ☆207Updated last year
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆18Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated last year