naver-ai / elva
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, EMNLP 2024
☆16Updated 3 months ago
Alternatives and similar repositories for elva:
Users that are interested in elva are comparing it to the libraries listed below
- Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)☆21Updated last month
- ☆37Updated 10 months ago
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023☆104Updated last year
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Updated 2 years ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆66Updated 6 months ago
- Google's Conceptual Captions Dataset translated into Korean☆22Updated 2 years ago
- Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models, EMNLP 2023☆45Updated 9 months ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…☆12Updated 9 months ago
- ☆24Updated last year
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆44Updated 7 months ago
- ☆24Updated last year
- Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).☆38Updated 7 months ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆11Updated last week
- Bilinear Attention Networks for Korean Visual Question Answering☆23Updated 8 months ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Updated 5 months ago
- ☆38Updated last year
- ☆10Updated 6 months ago
- Preference Learning for LLaVA☆41Updated 4 months ago
- This is an official implementation of GRIT-VLP☆21Updated 2 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆11Updated last year
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Updated 3 months ago
- [ICLR 2023] RC-MAE☆51Updated last year
- ☆28Updated last year
- ☆46Updated 11 months ago
- [2020.07-2021.07] 투빅스 14기 우수코드 저장소입니다.☆8Updated 4 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Updated last year
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆11Updated 3 months ago
- [AAAI2024] BOK-VQA : Bilingual Outside Knowledge-based Visual Question Answering via Graph Representation Pretraining☆1Updated 9 months ago
- ☆59Updated last month
- ☆14Updated last month