danielgural / semantic_video_search
A FiftyOne Plugin that allows you to search across any modality in your videos!
☆12Updated 10 months ago
Related projects: ⓘ
- Load any clip model with a standardized interface☆21Updated 4 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆33Updated 11 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆12Updated last month
- Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks from Google, in Pytorch☆97Updated 11 months ago
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- ☆15Updated last year
- Official Training and Inference Code of Amodal Expander, Proposed in Tracking Any Object Amodally☆14Updated 2 months ago
- Tracking through Containers and Occluders in the Wild (CVPR 2023) - Official Implementation☆39Updated 3 months ago
- ☆18Updated 3 weeks ago
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Updated 2 years ago
- ViT trained on COYO-Labeled-300M dataset☆29Updated last year
- ☆24Updated 5 months ago
- Official PyTorch implementation of RIO☆18Updated 3 years ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆17Updated 10 months ago
- ☆30Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆30Updated 5 months ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆92Updated last week
- ☆32Updated 8 months ago
- ☆17Updated last month
- This repo contains the official implementation of HAPPIER: Hierarchical Average Precision Training for Pertinent Image Retrieval (ECCV'22…☆20Updated last year
- Aggregating embeddings over time☆31Updated last year
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆65Updated 11 months ago
- ☆25Updated 3 weeks ago
- Official implementation of Generative Colorization of Structured Mobile Web Pages, WACV 2023.☆21Updated 9 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆28Updated 2 months ago
- Directed masked autoencoders☆13Updated last year
- Official code repository for the WACV 2022 paper "Visualizing Paired Image Similarity in Transformer Networks"☆20Updated 2 years ago
- An open source implementation of CLIP.☆32Updated last year
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆30Updated 4 months ago