☆13Dec 9, 2024Updated last year
Alternatives and similar repositories for MM-self-improve-qwen2vl
Users that are interested in MM-self-improve-qwen2vl are comparing it to the libraries listed below
Sorting:
- A Self-Training Framework for Vision-Language Reasoning☆88Jan 23, 2025Updated last year
- An implementation of Scalable Evaluation and Improvement of Document Set Expansion via Neural Positive-Unlabeled Learning without AllenNL…☆19Feb 20, 2024Updated 2 years ago
- text classification compitioin top 10 strategy☆18Aug 14, 2021Updated 4 years ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆57Jun 1, 2025Updated 9 months ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆42Nov 11, 2025Updated 3 months ago
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆28Nov 25, 2024Updated last year
- ☆19Jun 4, 2020Updated 5 years ago
- The official implementation of the ECCV'24 paper MC-CoT: Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models w…☆26May 19, 2024Updated last year
- ☆21Feb 18, 2022Updated 4 years ago
- 2021MXAP-DGL rank2☆35Mar 23, 2022Updated 3 years ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆18Feb 14, 2026Updated 3 weeks ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- Official Repo for "Why Settle for One? Text-to-ImageSet Generation and Evaluation"☆21Oct 1, 2025Updated 5 months ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- ☆17Dec 23, 2025Updated 2 months ago
- ☆10May 8, 2024Updated last year
- End-to-end implementation of the Social Graph Network (SGN), described in the Structural Reasoning for Image-based Social Relation Recogn…☆13Apr 3, 2024Updated last year
- ☆10Jul 24, 2018Updated 7 years ago
- ☆10Jan 28, 2024Updated 2 years ago
- Can VLMs understand students' hand-drawn math work?☆16Jan 20, 2026Updated last month
- ☆15Jul 22, 2024Updated last year
- ☆12Nov 2, 2024Updated last year
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆47Jun 18, 2024Updated last year
- Column Networks for Collective Classification: A novel deep learning model for collective classification in multi-relational domains☆12Nov 22, 2016Updated 9 years ago
- ☆16Oct 11, 2025Updated 4 months ago
- 收集LUG@NJU群的精华消息,好玩就行。☆12Jun 22, 2022Updated 3 years ago
- A list of Numerical Multimodal reasoning papers and their implementation☆11May 13, 2024Updated last year
- Joint Multi-label Attention Network (JMAN)☆12Sep 17, 2020Updated 5 years ago
- ☆15Jun 4, 2024Updated last year
- Code for Research Project TLDR☆25Jul 28, 2025Updated 7 months ago
- A pipeline for the automatic construction of geometry problems along with step-by-step solutions.☆17Aug 27, 2025Updated 6 months ago
- 🤔 When in Doubt: Improving Classification Performance with Alternating Normalization [Findings of EMNLP2021]☆14Oct 29, 2021Updated 4 years ago
- ☆11Oct 2, 2023Updated 2 years ago
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆11Sep 21, 2023Updated 2 years ago
- ☆13Sep 9, 2020Updated 5 years ago
- The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapte…☆17Jan 15, 2024Updated 2 years ago
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated last year
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated 2 years ago