robgon-art / open-clip
Test out OpenCLIP for Image Search and Automatic Captioning
☆22Updated last year
Alternatives and similar repositories for open-clip:
Users that are interested in open-clip are comparing it to the libraries listed below
- Library for converting from RGB / GrayScale image to base64 and back.☆19Updated 2 years ago
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆28Updated 3 months ago
- Load any clip model with a standardized interface☆21Updated 8 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆34Updated last year
- Supporting code for: Video Enriched Retrieval Augmented Generation Using Aligned Video Captions☆18Updated 5 months ago
- Fine-tuning OpenAI CLIP Model for Image Search on medical images☆75Updated 2 years ago
- A component that allows you to annotate an image with points and boxes.☆18Updated last year
- ViT trained on COYO-Labeled-300M dataset☆30Updated 2 years ago
- ☆13Updated 2 years ago
- CHARacter-awaRE Diffusion: Multilingual Character-Aware Encoders for Font-Aware Diffusers That Can Actually Spell☆14Updated last year
- Simple and easy stable diffusion inference with LightningModule on GPU, CPU and MPS (Possibly all devices supported by Lightning).☆17Updated last year
- Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, …☆67Updated last year
- Describe the format of image/text datasets☆11Updated 2 years ago
- Adam with minor modifications which give significant improvement☆19Updated 3 years ago
- ☆0Updated last year
- Unofficial Tensorflow-Keras implementation of Fastformer based on paper [Fastformer: Additive Attention Can Be All You Need](https://arxi…☆13Updated 3 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- ☆43Updated 8 months ago
- Download flickr8k, flickr30k image caption datasets☆13Updated 11 months ago
- Vision transformer finetuning scripts☆22Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.☆12Updated last year
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15Updated 3 years ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Updated last year
- MetaCLIP module for use with Autodistill.☆21Updated last year
- Repo from the "Learning with limited labeled data" seminar @ Uni of Tuebingen. A collection of notes, notebooks and slideshows to underst…☆17Updated last year
- Includes additional materials for the following keras.io blog post.☆12Updated 3 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 5 months ago