beccabai / multi-agent-data-selection
This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.
☆27Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for multi-agent-data-selection
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆89Updated last month
- The official implementation of the paper “LOKI:A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models”☆107Updated 2 weeks ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆46Updated last week
- ☆67Updated 6 months ago
- ✨✨ MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?☆77Updated last month
- Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want☆60Updated 3 weeks ago
- [NeurIPS 2024] MoVA: Adapting Mixture of Vision Experts to Multimodal Context☆132Updated last month
- The official code of the paper "PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction".☆42Updated last week
- [ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs☆64Updated last week
- Official implement of MIA-DPO☆32Updated last week
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆17Updated 3 weeks ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆60Updated 2 months ago
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆42Updated this week
- MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models☆51Updated last month
- ☆73Updated 8 months ago
- The official implementation of RAR☆72Updated 7 months ago
- [CVPR 2024] Official Code for the Paper "Compositional Chain-of-Thought Prompting for Large Multimodal Models"☆78Updated 4 months ago
- ☆103Updated 3 months ago
- The official implementation of the paper “Street-to-Satellite Image Synthesis with Diffusion Models and BEV Paradigm”☆26Updated 2 weeks ago
- [NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of…☆98Updated 3 weeks ago
- Making LLaVA Tiny via MoE-Knowledge Distillation☆55Updated 2 weeks ago
- ✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models☆137Updated this week
- [NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment☆48Updated last month
- ☆42Updated last month
- ☆23Updated 6 months ago
- Official repository of MMDU dataset☆74Updated last month
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆116Updated last month
- A Survey on Benchmarks of Multimodal Large Language Models☆59Updated last month
- 🔥🔥First-ever hour scale video understanding models☆156Updated 2 weeks ago
- ☆152Updated 4 months ago