ZjjConan / VLM-LwEIBView external linksLinks
The official pytorch implemention of our IJCV-2025 paper "Learning with Enriched Inductive Biases for Vision-Language Models".
☆14Mar 26, 2025Updated 10 months ago
Alternatives and similar repositories for VLM-LwEIB
Users that are interested in VLM-LwEIB are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Official PyTorch Code for "DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models"☆40Jan 26, 2026Updated 2 weeks ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Sep 11, 2024Updated last year
- A Multimodal Detection and Tracking System based on DJI Payload SDK and Mobile SDK.☆17Mar 3, 2024Updated last year
- [NeurIPS 2023] Formulating Discrete Probability Flow Through Optimal Transport☆21Jan 8, 2024Updated 2 years ago
- Official PyTorch implementation for paper "ProAPO: Progressively Automatic Prompt Optimization for Visual Classification". The paper is a…☆28Nov 9, 2025Updated 3 months ago
- This repo is the official pytorch implementation of the paper: CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-V…☆40Sep 10, 2025Updated 5 months ago
- 第二届“泰迪杯”数据分析职业技能大赛A题☆10Sep 15, 2020Updated 5 years ago
- [CVPR 2025] Official implementation of BiomedCoOp☆110Jun 13, 2025Updated 8 months ago
- A vision-language model with bidirectional progressive fusion and global-local alignment for enhanced medical image segmentation.☆17Dec 25, 2025Updated last month
- [CVPR 2025] Official Pytorch Code for Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation☆46Mar 27, 2025Updated 10 months ago
- [CVPR 2022] Self-supervised Image-specific Prototype Exploration for Weakly Supervised Semantic Segmentation☆88Aug 28, 2022Updated 3 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- [ICCV 2025 Highlight] Official PyTorch implementation of "SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segment…☆18Jan 18, 2026Updated 3 weeks ago
- ☆12Feb 7, 2018Updated 8 years ago
- code for LSN☆10Oct 28, 2024Updated last year
- [ACM MM2024] The code for HMLLM.☆11Oct 27, 2024Updated last year
- The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]☆49Mar 13, 2025Updated 11 months ago
- [CVPR 2022] Exploring Dual-task Correlation for Pose Guided Person Image Generation☆98Apr 11, 2024Updated last year
- Official implementation of the paper “Endowing Vision-Language Models with System 2 Thinking for Fine-Grained Visual Recognition,” AAAI 2…☆32Jan 30, 2026Updated 2 weeks ago
- This repository contains the code for the IEEE Robotics and Automation Letters paper "Open-Set Object Detection Using Classification-Free…☆14Dec 6, 2023Updated 2 years ago
- Official implementation of the paper "M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding"☆20Jan 14, 2026Updated last month
- Hypergraph Vision Transformers: Images are More than Nodes, More than Edges☆17Jul 25, 2025Updated 6 months ago
- OW-OVD: Unified Open World and Open Vocabulary Object Detection (CVPR 2025)☆23Dec 2, 2024Updated last year
- ☆13Jul 8, 2024Updated last year
- Can policy evaluation be automated? Or inevitably create hallucinated AI slop? We are trying to find out.☆55Updated this week
- [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text☆33Jan 31, 2026Updated 2 weeks ago
- IPO: Interpretable Prompt Optimization for Vision-Language Models(NeurIPS 2024)☆15Mar 4, 2025Updated 11 months ago
- [NeurIPS 2025] VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning☆25Dec 9, 2025Updated 2 months ago
- Official codebase for FACMIC: Federated Adaptative CLIP Model for Medical Image Classification (Accepted at MICCAI 2024)☆14Jun 21, 2024Updated last year
- This repository contains the **official implementation** of the paper: "VL2Lite: Task-Specific Knowledge Distillation from Large Vision-…☆16Mar 23, 2025Updated 10 months ago
- ☆11Dec 11, 2022Updated 3 years ago
- ☆12Dec 17, 2024Updated last year
- View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network (CVPR'24)☆56Mar 26, 2024Updated last year
- ITU ACM 2018 - Android Programming Study Group☆14Dec 13, 2018Updated 7 years ago
- [MICCAI 2025] FEAT:Full-Dimensional Efficient Attention Transformer for Medical Video Generation.☆21Sep 24, 2025Updated 4 months ago
- Implementation of Multi-view CNN in torch☆12May 18, 2017Updated 8 years ago
- Cross Visual Prompt Tuning [ICCV 2025]☆13Aug 3, 2025Updated 6 months ago
- ☆12Oct 30, 2024Updated last year
- Code for Language-Inspired Relation Transfer for Few-Shot Class-Incremental Learning in IEEE TPAMI☆15Apr 18, 2025Updated 9 months ago