MLLM-DataEngine: An Iterative Refinement Approach for MLLM
☆48May 24, 2024Updated last year
Alternatives and similar repositories for MLLM-DataEngine
Users that are interested in MLLM-DataEngine are comparing it to the libraries listed below
Sorting:
- AAAI 2024: Visual Instruction Generation and Correction☆96Feb 4, 2024Updated 2 years ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated last year
- A pytorch implementation of "Robust Facial Landmark Detection by Multi-order Multi-constrained Network"☆13Dec 9, 2020Updated 5 years ago
- A straightforward implementation of EGBM-based Generalized Additive Model☆14Oct 15, 2020Updated 5 years ago
- Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)☆46May 29, 2024Updated last year
- [CVPR 2024] LION: Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge☆153Sep 3, 2025Updated 5 months ago
- ☆16Oct 5, 2023Updated 2 years ago
- Mcity Data Engine☆21Feb 4, 2026Updated 3 weeks ago
- ☆14May 26, 2023Updated 2 years ago
- [ICLR2025] LLaVA-HR: High-Resolution Large Language-Vision Assistant☆246Aug 14, 2024Updated last year
- LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos. (CVPR 2025))☆56Jun 9, 2025Updated 8 months ago
- CVPR2023☆18Mar 18, 2023Updated 2 years ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Sep 6, 2024Updated last year
- Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model☆281Jun 25, 2024Updated last year
- ☆64Apr 9, 2024Updated last year
- Lifelong Learning via Progressive Distillation and Retrospection☆14Apr 2, 2019Updated 6 years ago
- 西安电子科技大学视觉开源☆14Sep 2, 2018Updated 7 years ago
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆296Mar 13, 2024Updated last year
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆505Aug 9, 2024Updated last year
- Detectron2 Toolbox and Benchmark for V3Det☆18Jun 2, 2024Updated last year
- ☆28Aug 13, 2025Updated 6 months ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆19Oct 4, 2022Updated 3 years ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆22Dec 4, 2024Updated last year
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated last year
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆80Jun 17, 2024Updated last year
- 【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"☆22Sep 26, 2024Updated last year
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Jul 17, 2025Updated 7 months ago
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- DELT: Data Efficacy for Language Model Training☆43Feb 12, 2026Updated 2 weeks ago
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆396Aug 24, 2024Updated last year
- A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring E…☆342Nov 6, 2025Updated 3 months ago
- ☆27Mar 21, 2024Updated last year
- 😎 curated list of awesome LMM hallucinations papers, methods & resources.☆150Mar 23, 2024Updated last year
- Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".☆58Jun 27, 2023Updated 2 years ago
- Official repository for the A-OKVQA dataset☆110May 8, 2024Updated last year
- Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).☆159Sep 27, 2024Updated last year
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆69May 31, 2024Updated last year
- ☆28Oct 19, 2021Updated 4 years ago
- CLEAR benchmark (NeurIPS 2021 Dataset & Benchmark)☆28Apr 23, 2023Updated 2 years ago