Asynchronousx / CLIPCap-XAILinks
A Simple, Explainable Vision Language Model for detecting manifacturing defects into products
β14Updated 4 months ago
Alternatives and similar repositories for CLIPCap-XAI
Users that are interested in CLIPCap-XAI are comparing it to the libraries listed below
Sorting:
- Inference and fine-tuning examples for vision models from π€ Transformersβ165Updated 6 months ago
- This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face β¦β33Updated 6 months ago
- β96Updated 2 months ago
- An AI framework for clinical diagnosis of 3D biomedical scansβ106Updated last year
- Actually Robust Training - Tool Inspired by Andrej Karpathy "Recipe for training neural networks". It allows you to decompose your Deepβ¦β43Updated last year
- Solving Computer Vision with AI agentsβ35Updated 7 months ago
- PyTorch implementation of DINO (Self-Distillation with No Labels) from scratch.β18Updated 8 months ago
- From scratch implementation of a vision language model in pure PyTorchβ254Updated last year
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer visβ¦β14Updated last year
- Self-Supervised Learning in PyTorchβ143Updated last year
- A code repository that cointains all the code for finetuning some of the popular LLMs on medical dataβ71Updated last year
- Bio-Medical EXpert LMM with English and Arabic Language Capabilitiesβ73Updated 3 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of modelsβ279Updated 6 months ago
- A Python package housing a collection of deep-learning multi-modal data fusion method pipelines! From data loading, to training, to evaluβ¦β198Updated 6 months ago
- Fine tune Gemma 3 on an object detection taskβ96Updated 6 months ago
- Official code for PEEKABOO2: Adapting Peekaboo with Segment Anything Model for Unsupervised Object Localization in Images and Videos.β29Updated last month
- Let's bake an image.β16Updated last month
- Vision Transformers for image classification, image segmentation, and object detection.β63Updated 3 months ago
- An XAI library that helps to explain AI models in a really quick & easy wayβ17Updated last year
- β56Updated last year
- Time Series Anomaly Detection using a Kolmogorov-Arnold Networkβ26Updated 8 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"β141Updated last month
- β255Updated 3 weeks ago
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.β30Updated 11 months ago
- Code and Documentation for the first place solution in 2023 Abdominal Trauma Detection Competition hosted by RSNA on Kaggle.β51Updated 2 years ago
- Composition of Multimodal Language Models From Scratchβ15Updated last year
- Evaluate custom and HuggingFace text-to-image/zero-shot-image-classification models like CLIP, SigLIP, DFN5B, and EVA-CLIP. Metrics incluβ¦β56Updated last year
- β114Updated last year
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.β34Updated 2 years ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β99Updated last week