AkiRusProd / CLIP-searchLinks
An impelementation of image search engine using CLIP (Contrastive Language-Image Pre-Training
☆14Updated last year
Alternatives and similar repositories for CLIP-search
Users that are interested in CLIP-search are comparing it to the libraries listed below
Sorting:
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆95Updated last year
- EdgeSAM model for use with Autodistill.☆29Updated last year
- DETRPose: Real-time end-to-end transformer model for multi-person pose estimation☆58Updated last month
- Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.☆138Updated 6 months ago
- Official Code for Tracking Any Object Amodally☆120Updated last year
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆68Updated last year
- Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning (CVPR 2025)☆32Updated 6 months ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆133Updated last year
- Minimal repository to demonstrate fast LoRA inference with Flux family of models.☆25Updated 4 months ago
- ☆26Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆64Updated last month
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆52Updated last year
- [ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning☆409Updated 2 weeks ago
- Edge Weight Prediction For Category-Agnostic Pose Estimation☆44Updated 6 months ago
- [ICCVW 2025] Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object Segmentation☆77Updated last month
- GroundedSAM Base Model plugin for Autodistill☆54Updated last year
- Official repository of "FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring"☆41Updated 2 weeks ago
- A simple demo for utilizing grounding dino and segment anything v2 models together☆20Updated last year
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆111Updated 5 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆59Updated 9 months ago
- PyTorch Implementation of "RetinaFace: Single-stage Dense Face Localisation in the Wild" | 88.90% on WiderFace Hard >> ONNX Support☆50Updated 9 months ago
- ☆44Updated 10 months ago
- Image Prompter for Gradio☆92Updated 2 years ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆58Updated this week
- Converting weights of Pytorch models to ONNX & TensorRT engines☆50Updated 2 years ago
- Code of paper "A new baseline for edge detection: Make Encoder-Decoder great again"☆40Updated 6 months ago
- A unified media (Image, Video, Audio, Text) diffusion repository, for education and learning.☆40Updated 8 months ago
- Gradio UI for running Meta AI's Segment Anything on own hardware. Promptable segmentation via keypoints and bounding boxes.☆65Updated 2 years ago
- A component that allows you to annotate an image with points and boxes.☆21Updated 2 years ago
- Official Repository of "ROSE: Remove Objects with Side Effects in Videos"☆115Updated 2 months ago