vl-rewardbench / VL_RewardBench
☆9Updated 3 months ago
Alternatives and similar repositories for VL_RewardBench:
Users that are interested in VL_RewardBench are comparing it to the libraries listed below
- ☆30Updated last week
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆65Updated 10 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆65Updated last month
- ☆8Updated 9 months ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆22Updated 9 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆45Updated 5 months ago
- ☆54Updated last year
- ☆20Updated last month
- Official Code for ACL 2023 Outstanding Paper: World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Languag…☆31Updated last year
- [ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding☆65Updated this week
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆47Updated 3 months ago
- ☆11Updated 2 months ago
- Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal …☆46Updated last month
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆70Updated 9 months ago
- M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆56Updated 3 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆28Updated 2 weeks ago
- A Self-Training Framework for Vision-Language Reasoning☆73Updated 2 months ago
- MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale☆35Updated 3 months ago
- ☆48Updated 4 months ago
- [AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning☆30Updated 6 months ago
- Preference Learning for LLaVA☆41Updated 4 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆55Updated 5 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆56Updated last month
- Code for "Reasoning to Learn from Latent Thoughts"☆77Updated this week
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆19Updated last year
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆20Updated 2 months ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆9Updated 5 months ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆45Updated 4 months ago
- ☆32Updated 8 months ago
- ☆17Updated last year