Liac-li / MM-self-improve-qwen2vlView external linksLinks
☆13Dec 9, 2024Updated last year
Alternatives and similar repositories for MM-self-improve-qwen2vl
Users that are interested in MM-self-improve-qwen2vl are comparing it to the libraries listed below
Sorting:
- A Self-Training Framework for Vision-Language Reasoning☆88Jan 23, 2025Updated last year
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- An implementation of Scalable Evaluation and Improvement of Document Set Expansion via Neural Positive-Unlabeled Learning without AllenNL…☆19Feb 20, 2024Updated last year
- text classification compitioin top 10 strategy☆18Aug 14, 2021Updated 4 years ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆57Jun 1, 2025Updated 8 months ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆42Nov 11, 2025Updated 3 months ago
- ☆19Jun 4, 2020Updated 5 years ago
- The official implementation of the ECCV'24 paper MC-CoT: Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models w…☆26May 19, 2024Updated last year
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆28Nov 25, 2024Updated last year
- ☆21Feb 18, 2022Updated 3 years ago
- 2021MXAP-DGL rank2☆35Mar 23, 2022Updated 3 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- the datasets of our paper☆11Feb 26, 2024Updated last year
- Official repository for "LFR-GAN: Local Feature Refinement based Generative Adversarial Network for Text-to-Image Generation" (TOMM 2023)…☆11Mar 21, 2023Updated 2 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆24Jul 21, 2025Updated 6 months ago
- Official Repo for "Why Settle for One? Text-to-ImageSet Generation and Evaluation"☆21Oct 1, 2025Updated 4 months ago
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- ☆17Dec 23, 2025Updated last month
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated 11 months ago
- ☆10Jul 24, 2018Updated 7 years ago
- ☆10May 8, 2024Updated last year
- ☆12Nov 2, 2024Updated last year
- ☆10Jan 28, 2024Updated 2 years ago
- End-to-end implementation of the Social Graph Network (SGN), described in the Structural Reasoning for Image-based Social Relation Recogn…☆13Apr 3, 2024Updated last year
- ☆15Jul 22, 2024Updated last year
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆47Jun 18, 2024Updated last year
- Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023☆11Sep 21, 2023Updated 2 years ago
- ☆11Oct 2, 2023Updated 2 years ago
- Code for Research Project TLDR☆25Jul 28, 2025Updated 6 months ago
- 🤔 When in Doubt: Improving Classification Performance with Alternating Normalization [Findings of EMNLP2021]☆14Oct 29, 2021Updated 4 years ago
- ☆15Jun 4, 2024Updated last year
- ☆16Oct 11, 2025Updated 4 months ago
- A list of Numerical Multimodal reasoning papers and their implementation☆11May 13, 2024Updated last year
- A pipeline for the automatic construction of geometry problems along with step-by-step solutions.☆16Aug 27, 2025Updated 5 months ago
- Joint Multi-label Attention Network (JMAN)☆12Sep 17, 2020Updated 5 years ago
- ☆10Jan 7, 2022Updated 4 years ago
- Column Networks for Collective Classification: A novel deep learning model for collective classification in multi-relational domains☆12Nov 22, 2016Updated 9 years ago
- 收集LUG@NJU群的精华消息,好玩就行。☆12Jun 22, 2022Updated 3 years ago
- AAAI2024 Global Competition on Math Problem Solving and Reasoning☆14Oct 4, 2023Updated 2 years ago