beccabai / Data-centric_multimodal_LLM
Survey on Data-centric Large Language Models
☆83Updated 10 months ago
Alternatives and similar repositories for Data-centric_multimodal_LLM
Users that are interested in Data-centric_multimodal_LLM are comparing it to the libraries listed below
Sorting:
- ☆95Updated last month
- [ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…☆53Updated this week
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆93Updated 6 months ago
- A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.☆60Updated last month
- ☆47Updated 5 months ago
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆20Updated 3 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆74Updated 5 months ago
- SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆107Updated 3 weeks ago
- A Self-Training Framework for Vision-Language Reasoning☆78Updated 3 months ago
- 关于LLM和Multimodal LLM的paper list☆38Updated last week
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆78Updated last month
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆39Updated 3 weeks ago
- Paper collections of multi-modal LLM for Math/STEM/Code.☆89Updated 3 weeks ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆67Updated 3 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆73Updated 6 months ago
- ☆73Updated 11 months ago
- ✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?☆118Updated 2 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆38Updated last month
- OpenThinkIMG is an end-to-end open-source framework that empowers LVLMs to think with images.☆50Updated this week
- ☆43Updated last month
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆34Updated last month
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆36Updated 5 months ago
- Code release for VTW (AAAI 2025) Oral☆39Updated 3 months ago
- Papers about Hallucination in Multi-Modal Large Language Models (MLLMs)☆89Updated 5 months ago
- ☆117Updated 3 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆214Updated 3 weeks ago
- The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…☆79Updated 3 months ago
- ☆53Updated 6 months ago
- ☆75Updated 4 months ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆94Updated last year