danielgural / semantic_video_searchLinks
A FiftyOne Plugin that allows you to search across any modality in your videos!
☆21Updated last month
Alternatives and similar repositories for semantic_video_search
Users that are interested in semantic_video_search are comparing it to the libraries listed below
Sorting:
- 4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022☆42Updated last year
- ☆75Updated 2 weeks ago
- Official implementation of "Active Image Indexing"☆59Updated 2 years ago
- FiftyOne Plugin for finding common image quality issues☆32Updated 8 months ago
- Timm model explorer☆40Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- Run zero-shot prediction models on your data☆32Updated 6 months ago
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆41Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- Estimate dataset difficulty and detect label mistakes using reconstruction error ratios!☆25Updated 6 months ago
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆18Updated last year
- Effective frame sampling for ML applications.☆20Updated 2 months ago
- ViT trained on COYO-Labeled-300M dataset☆32Updated 2 years ago
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆70Updated 2 months ago
- Run SOTA Vision-Language Model Florence-2 on your data!☆11Updated 3 months ago
- ☆58Updated last year
- An ONNX-based implementation of the CLIP model that doesn't depend on torch or torchvision.☆72Updated last year
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆132Updated last week
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 11 months ago
- Python Tools for Visual Dataset Transformation☆27Updated this week
- Pytorch based library to rank predicted bounding boxes using text/image user's prompts.☆51Updated 3 years ago
- State-of-the-art data augmentation search algorithms in PyTorch☆47Updated last year
- Vision-oriented multimodal AI☆49Updated last year
- Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models☆76Updated 2 months ago
- ☆33Updated 2 years ago
- Load any clip model with a standardized interface☆21Updated last year
- EdgeSAM model for use with Autodistill.☆27Updated last year
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 8 months ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆63Updated 3 months ago
- Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics inclu…☆53Updated 6 months ago