Liac-li/MM-self-improve-qwen2vl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Liac-li/MM-self-improve-qwen2vl)

Liac-li / MM-self-improve-qwen2vl

☆13

Alternatives and similar repositories for MM-self-improve-qwen2vl

Users that are interested in MM-self-improve-qwen2vl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

njucckevin / KnowCap
View on GitHub
Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
☆13Feb 15, 2024Updated 2 years ago
njucckevin / MM-Self-Improve
View on GitHub
A Self-Training Framework for Vision-Language Reasoning
☆90Jan 23, 2025Updated last year
njucckevin / CapArena
View on GitHub
An Arena-style Automated Evaluation Benchmark for Detailed Captioning
☆59Jun 1, 2025Updated last year
njucckevin / OpenMobile-Code
View on GitHub
The model, data and code for OpenMobile
☆50Jul 9, 2026Updated 3 weeks ago
XiXiphus / AcademicDocumentClassifier_without_AllenNLP
View on GitHub
An implementation of Scalable Evaluation and Improvement of Document Set Expansion via Neural Positive-Unlabeled Learning without AllenNL…
☆19Feb 20, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
the-laughing-monkey / agent-rl
View on GitHub
Scripts for training Qwen 2.5 VL with ms-swift and GRPO
☆12Feb 27, 2025Updated last year
amcphail / plot
View on GitHub
Plot package similar to gnuplot
☆23Mar 26, 2024Updated 2 years ago
BetterBench / Academic-paper-classification
View on GitHub
text classification compitioin top 10 strategy
☆18Aug 14, 2021Updated 4 years ago
OS-Copilot / OS-Sentinel
View on GitHub
[ACL 2026] Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic…
☆49Jul 5, 2026Updated 3 weeks ago
aburns4 / textualforesight
View on GitHub
☆12Aug 8, 2024Updated last year
starreeze / efuf
View on GitHub
the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…
☆21Apr 9, 2025Updated last year
weakrules / Denoise-multi-weak-sources
View on GitHub
☆19Jun 4, 2020Updated 6 years ago
feizc / PNAIC
View on GitHub
Partially Non-Autoregressive Image Captioning
☆10Sep 30, 2021Updated 4 years ago
CYBruce / MAXP-DGL-solutions
View on GitHub
☆21Feb 18, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ToyotaResearchInstitute / tristan
View on GitHub
TRISTAN: TRI's Situation and Trajectory Anticipation Networks
☆14Jun 8, 2026Updated last month
wlzhang2020 / LLMTreeRec
View on GitHub
The implement of LLMTreeRec
☆14Dec 9, 2024Updated last year
DynaMath / DynaMath
View on GitHub
A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models
☆30Nov 25, 2024Updated last year
PKU-ICST-MIPL / LFR-GAN_TOMM2023
View on GitHub
Official repository for "LFR-GAN: Local Feature Refinement based Generative Adversarial Network for Text-to-Image Generation" (TOMM 2023)…
☆10Mar 21, 2023Updated 3 years ago
hoangtuanvu / conformer_ocr
View on GitHub
Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This…
☆10Dec 27, 2021Updated 4 years ago
Sion1 / DFAN
View on GitHub
☆12Nov 2, 2024Updated last year
vipulgupta1011 / CALM
View on GitHub
☆11Oct 2, 2023Updated 2 years ago
nju-lug / LUG-Joke-Collection
View on GitHub
收集LUG@NJU群的精华消息，好玩就行。
☆12Jun 22, 2022Updated 4 years ago
mightyzau / InfMLLM
View on GitHub
☆19Dec 6, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
suny-sht / clip-red-circle
View on GitHub
Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023
☆12Sep 21, 2023Updated 2 years ago
chengtan9907 / mc-cot
View on GitHub
The official implementation of the ECCV'24 paper MC-CoT: Boosting the Power of Small Multimodal Reasoning Models to Match Larger Models w…
☆26May 19, 2024Updated 2 years ago
AutoGeo-Official / AutoGeo
View on GitHub
Code for AutoGeo.
☆17Aug 18, 2024Updated last year
Mihonarium / hass-spatial-lights-card
View on GitHub
Better picture-elements card. Spatially arrange and control multiple Home Assistant light entities from a single, highly visual Lovelace…
☆23Updated this week
MuyeHuang / EvoChart
View on GitHub
☆19Nov 3, 2025Updated 8 months ago
chuyg1005 / seeclick-crawler
View on GitHub
☆20Apr 24, 2024Updated 2 years ago
langgege-cqu / maxp_dgl
View on GitHub
2021MXAP-DGL rank2
☆35Mar 23, 2022Updated 4 years ago
Hai-chao-Zhang / OOSTraj
View on GitHub
[CVPR24] OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising
☆16Apr 4, 2024Updated 2 years ago
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
HAWLYQ / InfoMetIC
View on GitHub
☆13Sep 5, 2023Updated 2 years ago
MrPaoBrother / blockchain
View on GitHub
☆10Jul 24, 2018Updated 8 years ago
pppa2019 / swie_overmiss_llm4mt
View on GitHub
Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"
☆12Aug 26, 2023Updated 2 years ago
llm4sr / PO4ISR
View on GitHub
☆15Jun 4, 2024Updated 2 years ago
sunnweiwei / AmbigPrompt
View on GitHub
Answering Ambiguous Questions via Iterative Prompting
☆14May 25, 2024Updated 2 years ago
Alicebupt / CamRest676_chinese
View on GitHub
CamRest676 is an English data set, I translate it into Chinese for training nlu.
☆12Dec 20, 2017Updated 8 years ago
aaronserianni / attention-iou
View on GitHub
[CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps
☆13Mar 26, 2025Updated last year