☆30Oct 13, 2022Updated 3 years ago
Alternatives and similar repositories for REQA
Users that are interested in REQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes for Three-stream Interaction Decoder Network for RGB-Thermal Salient Object Detection☆30May 12, 2022Updated 4 years ago
- accepted by ieee sensors journal☆36Aug 30, 2020Updated 5 years ago
- The codes for 'Progressive cross-primitive consistency for open-world compositional zero-shot learning'☆34Mar 21, 2024Updated 2 years ago
- ☆38Dec 14, 2021Updated 4 years ago
- The codes for 'Non-Exemplar Online Class-incremental Continual Learning via Dual-prototype Self-augment and Refinement'☆33Mar 21, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆35Jun 25, 2022Updated 3 years ago
- ☆47Mar 21, 2024Updated 2 years ago
- The source codes and results of Efficient Wavelet Boost Learning-Based Multi-stage Progressive Refinement Network for Underwater Image En…☆41May 24, 2022Updated 4 years ago
- https://arxiv.org/abs/2408.02032☆136Jan 16, 2025Updated last year
- [CVPR 2023] Diversity-Aware Meta Visual Prompting☆84Nov 30, 2023Updated 2 years ago
- [EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"☆158Jul 7, 2025Updated 10 months ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆104Nov 21, 2024Updated last year
- ☆130Dec 9, 2024Updated last year
- [EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning☆37Dec 26, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation☆45Jul 10, 2023Updated 2 years ago
- Get/modify variable's value in another Linux running process☆10May 1, 2026Updated 3 weeks ago
- Foundation models based medical image analysis☆227May 7, 2026Updated 3 weeks ago
- [ECCV 2024] FairDomain: Achieving Fairness in Cross-Domain Medical Image Segmentation and Classification☆40Apr 15, 2026Updated last month
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆337Oct 14, 2025Updated 7 months ago
- ☆13May 7, 2025Updated last year
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆410Aug 24, 2024Updated last year
- Docker container☆245Aug 8, 2019Updated 6 years ago
- Collection of AWESOME vision-language models for vision tasks☆3,123Oct 14, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…☆73Feb 9, 2026Updated 3 months ago
- ☆1,136Jun 27, 2024Updated last year
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆209Jul 17, 2025Updated 10 months ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆25May 7, 2025Updated last year
- A very powerful and easy-to-use number precision calculation and formatting library.☆347Apr 3, 2026Updated last month
- A summarization of zero-shot image recognition methods, in the perspective of element-wise representation and reasoning , covering public…☆21Oct 12, 2024Updated last year
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,325May 4, 2024Updated 2 years ago
- X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)☆506Nov 25, 2022Updated 3 years ago
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆297Mar 13, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 纯前端、浏览器可用、超轻量的基于对象存储的图床管理工具,支持腾讯云COS阿里云OSS七牛云Kodo等兼容亚马逊S3接口的各大平台。 | A pure front-end, browser-compatible, ultra-lightweight image hosting…☆30Jan 25, 2025Updated last year
- 🚀 一款简单高效的文件批量重命名工具,支持Windows/macOS/Linux系统☆37Feb 6, 2025Updated last year
- [CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing☆718Nov 28, 2024Updated last year
- Strong and Open Vision Language Assistant for Mobile Devices☆1,354Apr 15, 2024Updated 2 years ago
- 🔑Portable open-source KMS Emulator in C☆1,142Jan 6, 2024Updated 2 years ago
- 2025年,最新全面整理linux入门资料,构建环境搭建,shell命令,posix编程代码案例,内核项目☆74Jan 15, 2025Updated last year
- DuAT: Dual-Aggregation Transformer Network for Medical Image Segmentation (PRCV)☆82May 7, 2024Updated 2 years ago