Asynchronousx / CLIPCap-XAILinks
A Simple, Explainable Vision Language Model for detecting manifacturing defects into products
β14Updated 4 months ago
Alternatives and similar repositories for CLIPCap-XAI
Users that are interested in CLIPCap-XAI are comparing it to the libraries listed below
Sorting:
- Inference and fine-tuning examples for vision models from π€ Transformersβ165Updated 5 months ago
- This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face β¦β33Updated 5 months ago
- Let's bake an image.β16Updated last month
- Official code for PEEKABOO2: Adapting Peekaboo with Segment Anything Model for Unsupervised Object Localization in Images and Videos.β29Updated last month
- β56Updated last year
- Solving Computer Vision with AI agentsβ35Updated 6 months ago
- vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)β61Updated 3 months ago
- From scratch implementation of a vision language model in pure PyTorchβ254Updated last year
- Inference, Fine Tuning and many more recipes with Gemma family of modelsβ279Updated 6 months ago
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer visβ¦β14Updated last year
- An AI framework for clinical diagnosis of 3D biomedical scansβ105Updated last year
- β59Updated 3 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β97Updated last week
- Vision Transformers for image classification, image segmentation, and object detection.β63Updated 3 months ago
- Simple and unified interface to zero-shot computer vision models curated for robotics use cases.β167Updated 3 months ago
- Practical Python exercises on classical computer vision and clean engineering practicesβ25Updated 9 months ago
- A repository containing general tutorials I'd like to share with the world.β81Updated last month
- PyTorch implementation of DINO (Self-Distillation with No Labels) from scratch.β18Updated 8 months ago
- Fine tune Gemma 3 on an object detection taskβ96Updated 6 months ago
- Diabetic Retinopathy Detection: Utilizing Multiprocessing for Processing Large Datasets and Transfer Learning to Fine-Tune Deep Learning β¦β16Updated 8 months ago
- β26Updated last year
- Bio-Medical EXpert LMM with English and Arabic Language Capabilitiesβ73Updated 3 months ago
- A Python package housing a collection of deep-learning multi-modal data fusion method pipelines! From data loading, to training, to evaluβ¦β197Updated 6 months ago
- The SCIN dataset contains 10,000+ images of dermatology conditions, crowdsourced with informed consent from US internet users. Contributiβ¦β153Updated last year
- T-JEPA official repositoryβ20Updated last year
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.β30Updated 11 months ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LLβ¦β21Updated last year
- β121Updated 3 weeks ago
- Ultralytics Notebooks πβ185Updated 2 weeks ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"β141Updated last month