Attention visualization in CLIP
☆17Dec 7, 2022Updated 3 years ago
Alternatives and similar repositories for CLIP-visualization
Users that are interested in CLIP-visualization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year
- What do CLIP Vision Transformers learn? Feature Visualization can show you!☆15Aug 29, 2024Updated last year
- Official code for "Vision Transformers with Self-Distilled Registers" (NeurIPS 2025 Spotlight)☆35Dec 6, 2025Updated 6 months ago
- This is the PyTorch implementation of paper: FSR (AAAI 2023 Oral).☆12Sep 12, 2023Updated 2 years ago
- ☆11Aug 31, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The code of Unsupervised Few-Shot Image Classification by Learning Features into Clustering Space☆15Jul 12, 2022Updated 3 years ago
- ☆12Feb 22, 2024Updated 2 years ago
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated 2 years ago
- [NeurIPS 2024] TopoFR: A Closer Look at Topology Alignment on Face Recognition☆37Oct 27, 2025Updated 7 months ago
- Image Quality Assessment Paper Reading☆15Sep 11, 2022Updated 3 years ago
- A large-scale benchmark for the evaluation of embeddings across a number of fine-grained and instance-level visual domains.☆17Jun 14, 2024Updated 2 years ago
- Code for ACM MM 2024 paper "A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning"☆19Dec 5, 2024Updated last year
- ☆18Aug 21, 2024Updated last year
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Generating Image Specific Text☆29Aug 14, 2023Updated 2 years ago
- Code of "Few-shot microscopy image cell segmentation " https://link.springer.com/chapter/10.1007/978-3-030-67670-4_9☆22Jun 17, 2025Updated 11 months ago
- [ICLR 2025] Official code for Combining Text-based and Drag-based Editing for Precise and Flexible Image Editing.☆21May 6, 2025Updated last year
- vrml-stl geometry converter☆16Nov 12, 2015Updated 10 years ago
- Face Lighting estimation using GANs☆23Oct 18, 2025Updated 7 months ago
- Implements "Depth Map Prediction from a Single Image using a Multi-Scale Deep Network" (Eigen et. al., NIPS2014)☆14May 18, 2017Updated 9 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Jun 8, 2026Updated last week
- Scripts for use with LongCLIP, including fine-tuning Long-CLIP☆63Mar 11, 2025Updated last year
- Create Persona dataset from reddit en movie category comment☆11Aug 6, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Plotting heatmaps with the self-attention of the [CLS] tokens in the last layer.☆50May 11, 2022Updated 4 years ago
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- ☆37Jan 25, 2024Updated 2 years ago
- This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…☆15Feb 25, 2026Updated 3 months ago
- How to use OpenAI API?☆12Nov 23, 2023Updated 2 years ago
- ☆11Nov 13, 2025Updated 7 months ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- An implementation of the FlowNetC correlation layer in tensorflow☆21Aug 17, 2018Updated 7 years ago
- ☆48Mar 15, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Make MNIST train data for YOLO.☆20Jun 29, 2021Updated 4 years ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆20Jun 2, 2025Updated last year
- ☆39May 22, 2025Updated last year
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- CGMaker with sparse 3DGS | Customized DUSt3R-to-COLMAP Converter☆10Nov 26, 2024Updated last year
- [CVPR23W] "A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion" by Haomin Zhuang, Yihua Zhang and Sijia Liu☆27Aug 27, 2024Updated last year
- Tensorflow raw implementation of paper "End-to-End Learning of Geometry and Context for Deep Stereo Regression"☆15Jul 5, 2017Updated 8 years ago