naver-ai / elva
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, EMNLP 2024
☆16Updated 5 months ago
Alternatives and similar repositories for elva
Users that are interested in elva are comparing it to the libraries listed below
Sorting:
- ☆38Updated 11 months ago
- Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)☆21Updated 3 weeks ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆12Updated 10 months ago
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆103Updated last year
- ☆24Updated last year
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆45Updated 11 months ago
- Google's Conceptual Captions Dataset translated into Korean☆22Updated 2 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Updated 2 years ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆44Updated 8 months ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆69Updated 8 months ago
- ☆46Updated last year
- [AAAI2024] BOK-VQA : Bilingual Outside Knowledge-based Visual Question Answering via Graph Representation Pretraining☆1Updated 10 months ago
- ☆38Updated last year
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Updated 7 months ago
- ☆29Updated last year
- [Under Review] Official PyTorch implementation code for realizing the technical part of Phantom of Latent representing equipped with enla…☆58Updated 7 months ago
- ☆24Updated last year
- [ICLR 2023] RC-MAE☆52Updated last year
- Official implementation of "OffsetBias: Leveraging Debiased Data for Tuning Evaluators"☆22Updated 8 months ago
- Bilinear Attention Networks for Korean Visual Question Answering☆23Updated 9 months ago
- ☆15Updated 2 months ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆55Updated 9 months ago
- [ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition☆10Updated 5 months ago
- Extended COCO Validation (ECCV) Caption dataset (ECCV 2022)☆56Updated last year
- Preference Learning for LLaVA☆44Updated 6 months ago
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs☆11Updated last month
- [ACL 2024 Findings] Official PyTorch Implementation code for realizing the technical part of CoLLaVO: Crayon Large Language and Vision mO…☆96Updated 10 months ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated 8 months ago
- Evaluating Multimodal Generative AI with Korean Educational Standards, NAACL 2025.☆12Updated 2 weeks ago
- ☆10Updated 8 months ago