danielgural / semantic_video_search
A FiftyOne Plugin that allows you to search across any modality in your videos!
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for semantic_video_search
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆18Updated 3 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆34Updated last year
- ViT trained on COYO-Labeled-300M dataset☆29Updated last year
- ☆64Updated last year
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆67Updated last year
- ☆58Updated 8 months ago
- ☆31Updated 2 years ago
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆39Updated 5 months ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆22Updated 10 months ago
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆16Updated last year
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆14Updated 4 months ago
- ☆30Updated this week
- ☆87Updated 10 months ago
- Python Tools for Visual Dataset Transformation☆26Updated this week
- FiftyOne Plugin for finding common image quality issues☆29Updated last month
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆97Updated last year
- Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank Adapters (LoRA).☆18Updated 3 months ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated last week
- 4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022☆42Updated last year
- ☆13Updated last year
- Effective frame sampling for ML applications.☆16Updated last week
- Aggregating embeddings over time☆31Updated last year
- Load any clip model with a standardized interface☆21Updated 6 months ago
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 3 years ago
- Run zero-shot prediction models on your data☆30Updated 5 months ago
- Supercharge Your PyTorch Image Models: Bag of Tricks to 8x Faster Inference with ONNX Runtime & Optimizations☆20Updated last month
- Official implementation of "Active Image Indexing"☆58Updated last year
- ☆26Updated last month
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year