mahmoudnafifi / WDYSLinks
PyTorch implementation of the paper: "What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Vision-Language Models." This implementation is unofficial and provided for research and experimental purposes.
☆10Updated 6 months ago
Alternatives and similar repositories for WDYS
Users that are interested in WDYS are comparing it to the libraries listed below
Sorting:
- (AAAI 2025) Official PyTorch implementation of paper "SAUGE: Taming SAM for Uncertainty-Aligned Multi-Granularity Edge Detection".☆21Updated 4 months ago
- ☆10Updated 10 months ago
- [WACV2025] source code of StrDA: https://arxiv.org/abs/2410.09913☆11Updated 5 months ago
- Deep learning sitting posture detection based on multimodal datasets(基于深度学习的多模态坐姿检测系统)☆17Updated 9 months ago
- UFPR-VCR: a dataset for vehicle color recognition that includes 10,039 images of vehicles in a wide range of real-world conditions, such …☆10Updated 11 months ago
- ☆14Updated 11 months ago
- ☆12Updated 2 months ago
- Official implementation for P2SAM (ACM MM 2024)☆13Updated 9 months ago
- Code Release for "MaskTerial: A Foundation Model for Automated 2D Material Flake Detection"☆13Updated 2 months ago
- 2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models (WWW 2025)☆10Updated 5 months ago
- 本项目主要是2025届浙江大学软件学院夏令营(AI营)的考核项目☆11Updated 6 months ago
- Robust End-to-end Point-Supervised Tiny Object Detection☆11Updated last month
- ☆12Updated 3 months ago
- Context-Informed Machine Translation of Manga using Multimodal Large Language Models☆11Updated 9 months ago
- The official implementation of InfoRM [NeurIPS 2024].☆11Updated 5 months ago
- [ICCV2025] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectors☆16Updated 2 months ago
- A novel spatial multi-modal omics framework, named PRototype-Aware Graph Adaptative aggregation (PRAGA) for spatial multi-modal omics ana…☆13Updated 3 months ago
- (CVPR 2025 Highlight) Official repository of paper "AODRaw: Towards RAW Object Detection in Diverse Conditions" (https://arxiv.org/pdf/24…☆19Updated 5 months ago
- [ICASSP 2025] Self-Prompting Polyp Segmentation in Colonoscopy Using Hybrid YOLO-SAM 2 Model☆70Updated 10 months ago
- Frontiers in Intelligent Colonoscopy [ColonSurvey | ColonINST | ColonGPT]☆88Updated 4 months ago
- Various video readers for PyTorch models training and a benchmark☆11Updated 3 weeks ago
- [ICCV 2025 Oral] CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation☆23Updated last month
- CVPR2024☆88Updated 6 months ago
- ☆20Updated 8 months ago
- A short course of visual modeling☆16Updated last year
- Try to use the SAM-ViT as the backbone to create the learnable prompt for semantic segmentation☆97Updated 2 years ago
- Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)☆14Updated 10 months ago
- ☆12Updated 9 months ago
- The official code for “Recurrent Generic Contour-based Instance Segmentation with Progressive Learning”, TCSVT, 2024.☆77Updated 3 months ago
- [IEEE TII 2025] Official Implementation for "VarAD: Lightweight High-Resolution Image Anomaly Detection via Visual Autoregressive Modelin…☆23Updated 5 months ago