liyongqi67 / GRACELinks

☆27

Alternatives and similar repositories for GRACE

Users that are interested in GRACE are comparing it to the libraries listed below

Sorting:

xinwei666 / MMGenerativeIR
Official Code of our AAAI-24 Paper: "Generative Multi-modal Knowledge Retrieval with Large Language Models".
☆29Updated 2 months ago
OpenMatch / UniVL-DR
[ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Spa…
☆53Updated last year
ZhangYiqun018 / StickerConv
☆59Updated last year
OpenMatch / MARVEL
[ACL 2024 Oral] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Mo…
☆39Updated last year
yhy-2000 / VideoDeepResearch
☆123Updated 2 weeks ago
open-vision-language / infoseek
☆67Updated 2 years ago
luomancs / ReMuQ
a multimodal retrieval dataset
☆24Updated 2 years ago
ZUCC-AI / UMIE
Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning
☆44Updated last year
edchengg / infoseek_eval
EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions
☆25Updated last year
TIGER-AI-Lab / UniIR
Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)
☆169Updated last year
HITsz-TMG / SKURG
☆18Updated 2 years ago
gyhdog99 / MoCLE
MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)
☆44Updated 5 months ago
LightChen233 / M3CoT
☆84Updated last year
LuminosityX / FNE
Implementation of our paper, Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination..
☆20Updated 2 years ago
AlignGPT-VL / AlignGPT
Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"
☆34Updated last year
open-vision-language / oven
☆40Updated 2 years ago
PaulLerner / ViQuAE
Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…
☆38Updated 11 months ago
JinYuanLi0012 / RiVEG
[IEEE TMM 2025 & ACL 2024 Findings] LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition
☆34Updated 4 months ago
liyongqi67 / MINDER
☆65Updated 5 months ago
haokunwen / DQU-CIR
[SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval
☆43Updated last year
ZhangYiqun018 / Multimodel-Dialog
自己阅读的多模态对话系统论文（及部分笔记）汇总
☆23Updated 2 years ago
yuezih / less-is-more
Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)
☆55Updated last year
X-PLUG / mPLUG-HalOwl
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
☆98Updated last year
BUAADreamer / SPN4CIR
[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
☆39Updated 2 months ago
ChiYeungLaw / LexLIP-ICCV23
Official Code for the ICCV23 Paper: "LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Sparse Retrieval…
☆40Updated 2 years ago
Small-Model-Gap / Small-Model-Learnability-Gap
☆18Updated last month
luka-group / mDPO
[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.
☆83Updated last year
tianyang-x / Mixture-of-Domain-Adapters
Codebase for ACL 2023 paper "Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memori…
☆52Updated 2 years ago
Go2Heart / EchoSight
[EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.
☆77Updated 5 months ago
Gary-code / KECVQG
[ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"
☆11Updated last year