huofushuo / REQAView external linksLinks
☆27Oct 13, 2022Updated 3 years ago
Alternatives and similar repositories for REQA
Users that are interested in REQA are comparing it to the libraries listed below
Sorting:
- Codes for Three-stream Interaction Decoder Network for RGB-Thermal Salient Object Detection☆27May 12, 2022Updated 3 years ago
- accepted by ieee sensors journal☆33Aug 30, 2020Updated 5 years ago
- The codes for 'Progressive cross-primitive consistency for open-world compositional zero-shot learning'☆31Mar 21, 2024Updated last year
- The codes for 'Non-Exemplar Online Class-incremental Continual Learning via Dual-prototype Self-augment and Refinement'☆30Mar 21, 2024Updated last year
- ☆33Dec 14, 2021Updated 4 years ago
- ☆33Jun 25, 2022Updated 3 years ago
- ☆42Mar 21, 2024Updated last year
- The source codes and results of Efficient Wavelet Boost Learning-Based Multi-stage Progressive Refinement Network for Underwater Image En…☆38May 24, 2022Updated 3 years ago
- https://arxiv.org/abs/2408.02032☆131Jan 16, 2025Updated last year
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆101Nov 21, 2024Updated last year
- [EMNLP'24] Code and data for paper "Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models"☆154Jul 7, 2025Updated 7 months ago
- ☆126Dec 9, 2024Updated last year
- [EMNLP'24] MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning☆36Dec 26, 2024Updated last year
- Effinformer: A Deep-Learning-Based Data-Driven Modeling of DC–DC Bidirectional Converters (Published in: IEEE Transactions on Instrumenta…☆11May 9, 2024Updated last year
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆325Oct 14, 2025Updated 4 months ago
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…☆70Updated this week
- A summarization of zero-shot image recognition methods, in the perspective of element-wise representation and reasoning , covering public…☆20Oct 12, 2024Updated last year
- [arXiv 2501.13117]The Multiplex CoT makes AI more thoughtful.☆18Feb 9, 2025Updated last year
- Collection of AWESOME vision-language models for vision tasks☆3,081Oct 14, 2025Updated 4 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆204Jul 17, 2025Updated 6 months ago
- jyf-drawing-board是一个背景透明的Web画板项目,使用HTML5 的<canvas>元素来实现绘图功能。☆20Feb 8, 2025Updated last year
- A very powerful and easy-to-use number precision calculation and formatting library.☆357Oct 29, 2025Updated 3 months ago
- SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation (AAAI24)☆25Jul 2, 2024Updated last year
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆296Mar 13, 2024Updated last year
- A flexible and efficient codebase for training visually-conditioned language models (VLMs)☆922Jul 4, 2024Updated last year
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,334May 4, 2024Updated last year
- 🚀 Gone - A Lightweight Dependency Injection Framework for Go | Tag-based Auto Injection | Supports Config Center/Lifecycle Management | …☆132Dec 15, 2025Updated last month
- [ICRA2023] Implementation of Visual Language Maps for Robot Navigation☆646Jul 9, 2024Updated last year
- 🚀 一款简单高效的文件批量重命名工具,支持Windows/macOS/Linux系统☆35Feb 6, 2025Updated last year
- 纯前端、浏览器可用、超轻量的基于对象存储的图床管理工具,支持腾讯云COS阿里云OSS七牛云Kodo等兼容亚马逊S3接口的各大平台。 | A pure front-end, browser-compatible, ultra-lightweight image hosting…☆29Jan 25, 2025Updated last year
- 2025年,最新全面整理linux入门资料,构建环境搭建,shell命令,posix编程代码案例,内核项目☆63Jan 15, 2025Updated last year
- Awesome Knowledge Distillation☆3,811Dec 25, 2025Updated last month
- ☆52Sep 24, 2023Updated 2 years ago
- Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)☆99Jun 4, 2025Updated 8 months ago
- GPT4V-level open-source multi-modal model based on Llama3-8B☆2,432Mar 3, 2025Updated 11 months ago
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆122Dec 26, 2024Updated last year
- Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks☆3,816Updated this week
- [CVPR 2024] Prompt Highlighter: Interactive Control for Multi-Modal LLMs☆157Jul 23, 2024Updated last year
- Doge Family of Small Language Models☆184Jan 6, 2026Updated last month