Wusiwei0410 / SciMMIRLinks
☆24Updated last year
Alternatives and similar repositories for SciMMIR
Users that are interested in SciMMIR are comparing it to the libraries listed below
Sorting:
- [ACL 2024] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module …☆36Updated last year
- Codebase for ACL 2023 paper "Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memori…☆49Updated last year
- ☆17Updated last year
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆135Updated 2 years ago
- Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)☆161Updated 10 months ago
- ☆40Updated 2 years ago
- Released code for our ICLR23 paper.☆65Updated 2 years ago
- Paper, dataset and code list for multimodal dialogue.☆21Updated 7 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆43Updated last month
- ☆24Updated last year
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆98Updated 7 months ago
- Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue (ACL 2024)☆23Updated last year
- Scaling Sentence Embeddings with Large Language Models☆111Updated last year
- This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have b…☆77Updated 2 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆90Updated last year
- ☆49Updated 6 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆79Updated 9 months ago
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆70Updated 10 months ago
- SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images (AAAI2023)☆92Updated 4 months ago
- TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning☆23Updated 11 months ago
- Code for "Small Models are Valuable Plug-ins for Large Language Models"☆131Updated 2 years ago
- Source code and data used in the papers ViQuAE (Lerner et al., SIGIR'22), Multimodal ICT (Lerner et al., ECIR'23) and Cross-modal Retriev…☆38Updated 8 months ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆44Updated last year
- ☆55Updated 7 months ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated last year
- mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)☆96Updated 2 years ago
- ☆80Updated last year
- ☆60Updated last year
- [Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics☆38Updated 7 months ago
- ☆39Updated 10 months ago