robgon-art / open-clip
Test out OpenCLIP for Image Search and Automatic Captioning
☆22Updated last year
Alternatives and similar repositories for open-clip:
Users that are interested in open-clip are comparing it to the libraries listed below
- Library for converting from RGB / GrayScale image to base64 and back.☆19Updated 2 years ago
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆29Updated 4 months ago
- Python Tools for Visual Dataset Transformation☆26Updated 2 months ago
- A component that allows you to annotate an image with points and boxes.☆19Updated last year
- Official code repository for the WACV 2022 paper "Visualizing Paired Image Similarity in Transformer Networks"☆21Updated 2 years ago
- Describe the format of image/text datasets☆11Updated 2 years ago
- ☆13Updated 2 years ago
- Code for reproducing IS-Count: Large-scale Object Counting with Importance Sampling (AAAI 2022)☆26Updated 2 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 6 months ago
- Convert datasets from Hugging Face to FiftyOne for Visualization☆11Updated 11 months ago
- Load any clip model with a standardized interface☆21Updated 9 months ago
- Visionner turn raw image data into numpy array, more suitable for deep learning task☆10Updated last year
- Repo from the "Learning with limited labeled data" seminar @ Uni of Tuebingen. A collection of notes, notebooks and slideshows to underst…☆17Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆35Updated last year
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆17Updated last year
- Adam with minor modifications which give significant improvement☆19Updated 3 years ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆17Updated 2 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Visual Clustering: Clustering Plotted Data by Image Segmentation☆24Updated last year
- Official Code for MIMETIC^2☆12Updated 3 months ago
- Script and models for clustering LAION-400m CLIP embeddings.☆25Updated 3 years ago
- Includes additional materials for the following keras.io blog post.☆12Updated 3 years ago
- Self-Supervised Object Detection via Generative Image Synthesis☆28Updated 3 years ago
- Facial Landmark Detection using OpenCV and Mediapipe☆11Updated 2 years ago
- Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"☆15Updated 3 years ago
- ViT trained on COYO-Labeled-300M dataset☆31Updated 2 years ago
- arXiv 23 "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs"☆14Updated 2 months ago
- EdgeSAM model for use with Autodistill.☆26Updated 8 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆49Updated 3 weeks ago
- ☆11Updated 2 years ago