huofushuo / TIDNet
Codes for Three-stream Interaction Decoder Network for RGB-Thermal Salient Object Detection
☆15Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for TIDNet
- The codes for 'Progressive cross-primitive consistency for open-world compositional zero-shot learning'☆15Updated 7 months ago
- ☆15Updated 2 years ago
- Welcome to my homepage☆7Updated 2 months ago
- The codes for 'Non-Exemplar Online Class-incremental Continual Learning via Dual-prototype Self-augment and Refinement'☆17Updated 7 months ago
- ☆18Updated 7 months ago
- ☆20Updated 2 years ago
- ☆21Updated 2 years ago
- The source codes and results of Efficient Wavelet Boost Learning-Based Multi-stage Progressive Refinement Network for Underwater Image En…☆24Updated 2 years ago
- https://arxiv.org/abs/2408.02032☆58Updated 2 weeks ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆55Updated 2 months ago
- Foundation models based medical image analysis☆52Updated this week
- [CVPR 2023] Diversity-Aware Meta Visual Prompting☆77Updated 11 months ago
- [EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"☆62Updated last month
- ⭐️⭐️⭐️⭐️⭐️Spring 插件化开发框架,轻、快、易、稳 Spring plugin development framework, Light, Fast, Easy, and Stable⭐️⭐️⭐️⭐️⭐️☆10Updated 2 months ago
- Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR …☆251Updated last year
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆286Updated 2 months ago
- ☆14Updated this week
- Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)☆139Updated last month
- A physics-guided hierarchical deep network (PhyRes-LSTM) framework, which integrates external knowledge with deep neural networks to guid…☆16Updated 2 months ago
- ☆348Updated 3 months ago
- The official GitHub page for the survey paper "Towards Next-Generation LLM-based Recommender Systems: A Survey and Beyond". And this pape…☆109Updated this week
- This repository is the official implementation of "DTL: Disentangled Transfer Learning for Visual Recognition", which is accepted by AAAI…☆75Updated 9 months ago
- 📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).☆445Updated last month
- [ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation☆39Updated last year
- Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"☆16Updated 6 months ago
- ☆76Updated 2 months ago
- A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.☆274Updated 3 weeks ago
- [CVPR 2024] Official PyTorch Code for "PromptKD: Unsupervised Prompt Distillation for Vision-Language Models"☆230Updated last month
- A Unified Parameter-Efficient Transfer Learning Benchmark for Computer Vision Tasks☆260Updated 2 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆90Updated 2 months ago