beccabai / Data-centric_multimodal_LLM
Survey on Data-centric Large Language Models
☆83Updated 9 months ago
Alternatives and similar repositories for Data-centric_multimodal_LLM:
Users that are interested in Data-centric_multimodal_LLM are comparing it to the libraries listed below
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆67Updated 2 months ago
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆75Updated 3 weeks ago
- [EMNLP 2024 Findings🔥] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context In…☆92Updated 5 months ago
- A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.☆56Updated last month
- ☆93Updated last week
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆138Updated 2 months ago
- ☆91Updated 2 weeks ago
- ☆48Updated 4 months ago
- ✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?☆109Updated last month
- SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆92Updated this week
- ☆90Updated 3 months ago
- ☆73Updated 3 months ago
- ☆72Updated 10 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models☆117Updated last week
- A Self-Training Framework for Vision-Language Reasoning☆76Updated 3 months ago
- Code release for VTW (AAAI 2025) Oral☆34Updated 3 months ago
- A RLHF Infrastructure for Vision-Language Models☆171Updated 5 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to unified multimodal models.☆173Updated 2 weeks ago
- Paper List of Inference/Test Time Scaling/Computing☆195Updated this week
- Code for "Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More"☆36Updated 3 weeks ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆100Updated last month
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆58Updated 4 months ago
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency☆100Updated 3 weeks ago
- [CVPR' 25] Interleaved-Modal Chain-of-Thought☆24Updated last month
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆35Updated last month
- ☆54Updated last month
- Official repository of MMDU dataset☆89Updated 6 months ago
- ☆113Updated 2 months ago
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆114Updated 2 weeks ago
- 关于LLM和Multimodal LLM的paper list☆35Updated this week