AkiRusProd / CLIP-searchLinks
An impelementation of image search engine using CLIP (Contrastive Language-Image Pre-Training
☆14Updated last year
Alternatives and similar repositories for CLIP-search
Users that are interested in CLIP-search are comparing it to the libraries listed below
Sorting:
- DETRPose: Real-time end-to-end transformer model for multi-person pose estimation☆68Updated last month
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆100Updated last year
- Edge Weight Prediction For Category-Agnostic Pose Estimation☆45Updated last week
- Official Code for Tracking Any Object Amodally☆120Updated last year
- A simple demo for utilizing grounding dino and segment anything v2 models together☆21Updated last year
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆52Updated last year
- Official repository of "FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring"☆77Updated 2 months ago
- A component that allows you to annotate an image with points and boxes.☆21Updated 2 years ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆134Updated last year
- EdgeSAM model for use with Autodistill.☆29Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆69Updated last year
- A Gradio web UI for Depth-Pro, Sharp Monocular Metric Depth Estimation☆55Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆70Updated last year
- ☆26Updated last year
- This repository contains code for deploying a Gradio application using the SAM2 model for video processing. The application allows users …☆45Updated last year
- ☆44Updated last year
- Unified Negative Pair Generation toward Well-discriminative Feature Space for Face Recognition☆34Updated 3 years ago
- Content-Based Image Retrieval (CBIR) using Faiss (Facebook) and many different feature extraction methods ( VGG16, ResNet50, Local Binary…☆46Updated last year
- Aging Time Lapse using Stable Diffusion☆14Updated last year
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆111Updated 6 months ago
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆79Updated 3 months ago
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆58Updated 2 years ago
- Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning (CVPR 2025)☆33Updated 8 months ago
- Real-time object detection using Florence-2 with a user-friendly GUI.☆30Updated 6 months ago
- Python scripts performing object detection using the YOLOv9 MIT model in ONNX.☆40Updated last year
- ☆31Updated 2 years ago
- Repo for event-based binary image reconstruction.☆33Updated last year
- [WINNER SOLUTION] soccernet monocular depth estimation solution☆13Updated 5 months ago
- Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-lan…☆150Updated last year
- "A Deep Moving-camera Background Model" [Erez, Shapira Weber, and Freifeld, ECCV 2022]☆46Updated 2 years ago