Liac-li / MM-self-improve-qwen2vl
☆11Updated last month
Alternatives and similar repositories for MM-self-improve-qwen2vl:
Users that are interested in MM-self-improve-qwen2vl are comparing it to the libraries listed below
- ☆57Updated 7 months ago
- A Self-Training Framework for Vision-Language Reasoning☆60Updated 2 months ago
- [Preprint] A Neural-Symbolic Self-Training Framework☆101Updated 5 months ago
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)☆19Updated 2 months ago
- The official code repository for PRMBench.☆56Updated this week
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆55Updated 2 months ago
- M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning☆49Updated 3 weeks ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆160Updated 11 months ago
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆30Updated 7 months ago
- mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating☆87Updated 11 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆33Updated 9 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆76Updated 6 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆53Updated 5 months ago
- ☆16Updated last year
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆40Updated 2 months ago
- ☆44Updated 3 months ago
- ☆78Updated last year
- A RLHF Infrastructure for Vision-Language Models☆145Updated 2 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆76Updated 11 months ago
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆89Updated last week
- Website for MathVista☆14Updated 3 weeks ago
- my commonly-used tools☆48Updated last week
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆49Updated last month
- ☆39Updated 2 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆49Updated 3 months ago
- The reinforcement learning codes for dataset SPA-VL☆26Updated 6 months ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆35Updated 3 months ago
- MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.☆78Updated 3 months ago
- An Easy-to-use Hallucination Detection Framework for LLMs.☆55Updated 8 months ago
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models☆22Updated 4 months ago