naver-ai / elvaView external linksLinks
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning, EMNLP 2024
β19Dec 16, 2024Updated last year
Alternatives and similar repositories for elva
Users that are interested in elva are comparing it to the libraries listed below
Sorting:
- π Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"β13Mar 26, 2024Updated last year
- β11Oct 2, 2024Updated last year
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Dataβ14Sep 30, 2023Updated 2 years ago
- [EMNLP 2023] Official repository for Dialogue Chain-of-Thought Distillation (DONUT & DOCTOR)β11Nov 15, 2023Updated 2 years ago
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"β40Dec 13, 2024Updated last year
- [NeurIPS 2025] Reasoning Models Better Express Their Confidence"β22Nov 19, 2025Updated 2 months ago
- Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining, WACV 2024β14Jan 3, 2024Updated 2 years ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our focβ¦β32Jun 13, 2024Updated last year
- ProbVLM: Probabilistic Adapter for Frozen Vision-Language Modelsβ45Dec 21, 2023Updated 2 years ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)β22Nov 1, 2023Updated 2 years ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)β15Jul 9, 2023Updated 2 years ago
- Collection of PhD Advice Linksβ20Oct 14, 2022Updated 3 years ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utβ¦β23Dec 4, 2024Updated last year
- β23Aug 26, 2023Updated 2 years ago
- [ECCVβ24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"β21Mar 26, 2025Updated 10 months ago
- Interview-based evaluation of LLMsβ24Jan 8, 2025Updated last year
- Official Implementation of SCOB [ICCV 2023]β23Nov 16, 2023Updated 2 years ago
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"β23Oct 11, 2024Updated last year
- β23Jul 8, 2023Updated 2 years ago
- πΈ Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"β22Sep 5, 2023Updated 2 years ago
- Official PyTorch implementation of Extract Free Dense Misalignment from CLIP (AAAI'25)β25Apr 20, 2025Updated 9 months ago
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalizationβ29Sep 12, 2024Updated last year
- Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023β109Oct 24, 2023Updated 2 years ago
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insightsβ28Oct 28, 2024Updated last year
- ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMsβ28Aug 15, 2025Updated 6 months ago
- Training code for CLIP-FlanT5β30Jul 29, 2024Updated last year
- An Enhanced CLIP Framework for Learning with Synthetic Captionsβ39Apr 18, 2025Updated 9 months ago
- ALIGN trained on COYO-datasetβ29Apr 30, 2024Updated last year
- The official implementation of γMLLMs-Augmented Visual-Language Representation Learningγβ31Mar 12, 2024Updated last year
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignmentβ29Sep 27, 2024Updated last year
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.β11Aug 30, 2024Updated last year
- Bias-to-Text: Debiasing Unknown Visual Biases through Language Interpretationβ32May 21, 2023Updated 2 years ago
- [ICML 2024] SimPro: A Simple Probabilistic Framework Towards Realistic Long-Tailed Semi-Supervised Learningβ32Sep 30, 2024Updated last year
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"β33Oct 12, 2024Updated last year
- Concurrency libraryβ16Oct 13, 2024Updated last year
- β11Dec 23, 2024Updated last year
- β13Dec 16, 2022Updated 3 years ago
- Scalable DBSCAN and OPTICS for clustering high-dimensional datasets using random projectionsβ13Nov 1, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".β12Oct 14, 2024Updated last year