Asynchronousx / CLIPCap-XAILinks
A Simple, Explainable Vision Language Model for detecting manifacturing defects into products
☆14Updated 2 months ago
Alternatives and similar repositories for CLIPCap-XAI
Users that are interested in CLIPCap-XAI are comparing it to the libraries listed below
Sorting:
- From scratch implementation of a vision language model in pure PyTorch☆251Updated last year
- Let's bake an image.☆15Updated last week
- ☆58Updated last month
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆162Updated 3 months ago
- Bio-Medical EXpert LMM with English and Arabic Language Capabilities☆71Updated last month
- Fine tune Gemma 3 on an object detection task☆89Updated 4 months ago
- vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)☆56Updated last month
- An AI framework for clinical diagnosis of 3D biomedical scans☆104Updated last year
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last week
- The SCIN dataset contains 10,000+ images of dermatology conditions, crowdsourced with informed consent from US internet users. Contributi…☆149Updated last year
- Ultralytics Notebooks 🚀☆147Updated last week
- Solving Computer Vision with AI agents☆34Updated 4 months ago
- MIRIAD is a million scale Medical Instruction and RetrIeval Datatset☆129Updated last week
- This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face …☆33Updated 3 months ago
- ☆77Updated this week
- Vision Transformers for image classification, image segmentation, and object detection.☆63Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆274Updated 4 months ago
- ☆33Updated last month
- ☆56Updated last year
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆117Updated 5 months ago
- Official code for PEEKABOO2: Adapting Peekaboo with Segment Anything Model for Unsupervised Object Localization in Images and Videos.☆29Updated 3 weeks ago
- A code repository that cointains all the code for finetuning some of the popular LLMs on medical data☆66Updated last year
- Hibou: Foundational Models for Pathology☆75Updated last year
- ☆25Updated last year
- yolosegment2labelme - a Python package that allows you to convert YOLO segmentation prediction results to LabelMe and anylabeling JSON fo…☆10Updated last year
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.☆29Updated 9 months ago
- A tool for converting computer vision label formats.☆79Updated last week
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆33Updated last year
- Deep Learning for Computer Vision☆60Updated last year
- A collection of sophisticated computer vision and machine learning problems for graduate-level researchers and practitioners☆39Updated 5 months ago