CSU-JPG/V-MAGE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CSU-JPG/V-MAGE)

CSU-JPG / V-MAGE

[ACL '26 Findings] V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in MLLMs

☆27

Alternatives and similar repositories for V-MAGE

Users that are interested in V-MAGE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CSU-JPG / Chart2Code
View on GitHub
[ACL-main-2026]We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Mode…
☆29Jan 27, 2026Updated 5 months ago
CSU-JPG / Awesome-VLM-Reasoning
View on GitHub
☆21May 19, 2025Updated last year
CSU-JPG / MVPBench
View on GitHub
Seeing is Not Reasoning: MVPBench for Graph-based Evaluation of Multi-path Visual Physical CoT
☆14Jul 30, 2025Updated 11 months ago
CSU-JPG / TextAtlas
View on GitHub
[ICML 2026]A Large-scale Dataset for training and evaluating model's ability on Dense Text Image Generation
☆93Sep 27, 2025Updated 9 months ago
SuyangLumiere / QwenImageEdit_MultiGPU
View on GitHub
A lightweight implementation of the Qwen-Image-Edit model for inference and LoRA fine-tuning on 8×V100 GPUs
☆108Mar 19, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Shopee-MUG / MUG-U
View on GitHub
一个强大的多模态大语言模型（MLLM），支持文本、图像、视频等多模态输入，具备强大的理解、推理和生成能力。
☆23Mar 19, 2025Updated last year
waynelee-lwc / 2119-snl-compiler-design
View on GitHub
吉林大学编译原理课程设计——编译器实现
☆16Feb 9, 2023Updated 3 years ago
BoLeiChen / SemNav-HRO
View on GitHub
☆14Mar 20, 2023Updated 3 years ago
fal-ai / lavender-data
View on GitHub
Load & manage evolving datasets efficiently
☆22Aug 22, 2025Updated 11 months ago
zai-org / UI2Code_N
View on GitHub
☆78May 2, 2026Updated 2 months ago
xlite-dev / qwen-image-fast
View on GitHub
⚡️Qwen-Image 4.8x🎉 speedup with Hybrid Acceleration for low VRAM GPUs
☆17Oct 24, 2025Updated 9 months ago
HuangOwen / RoLoRA
View on GitHub
[EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
☆41Sep 24, 2024Updated last year
ahnjaewoo / FlashAdventure
View on GitHub
🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"
☆27Apr 26, 2026Updated 2 months ago
CSU-JPG / VJA
View on GitHub
[ICML 26 ORAL] When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models
☆27Jun 30, 2026Updated 3 weeks ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Dogacel / Attention-Drift
View on GitHub
Code for the paper *Attention Drift: What Speculative Decoding Models Learn*.
☆27May 12, 2026Updated 2 months ago
apple / ml-gie-bench
View on GitHub
☆21Jul 24, 2025Updated last year
CraftJarvis / OpenHA
View on GitHub
Repo for Paper "OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft"
☆41Jun 5, 2026Updated last month
phyworld / phyworld
View on GitHub
☆175Jan 6, 2025Updated last year
MajorDavidZhang / Generalization_unified_VLM
View on GitHub
☆24May 23, 2025Updated last year
THUSI-Lab / Awesome-LFMs-Play-Games
View on GitHub
A Survey on Large Foundation Models as Game Players - Datasets, Models, Harness and Benchmarks
☆37May 13, 2026Updated 2 months ago
OpenGVLab / VRBench
View on GitHub
[ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos
☆28Jun 4, 2026Updated last month
univ-esuty / ambifusion
View on GitHub
Official repository for the paper ''ambigram generation by a diffusion model''.
☆17Aug 9, 2023Updated 2 years ago
YujieLu10 / Seeker
View on GitHub
☆11May 24, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
opendatalab / CLIP-Parrot-Bias
View on GitHub
ECCV2024_Parrot Captions Teach CLIP to Spot Text
☆66Sep 6, 2024Updated last year
DataArcTech / ChartBench
View on GitHub
☆16May 15, 2025Updated last year
TAU-VAILab / SemRayLoc
View on GitHub
☆22Jul 15, 2025Updated last year
PRIS-CV / Category-Specific-Prompt
View on GitHub
Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"
☆14Feb 21, 2024Updated 2 years ago
zihuixue / MKE
View on GitHub
[ICCV 2021] Multimodal Knowledge Expansion
☆10Aug 28, 2021Updated 4 years ago
HKUST-LongGroup / CoMM
View on GitHub
[CVPR 2025 Highlight] Official repository for CoMM Dataset
☆56Dec 31, 2024Updated last year
ATR-DBI / Map-EQA
View on GitHub
☆12Oct 10, 2024Updated last year
Odysseusq / VLCache
View on GitHub
Official Repo for paper "VLCache: Computing 2% Vision Tokens and Reusing 98% for Vision-Language Inference"
☆16Mar 28, 2026Updated 3 months ago
ulab-uiuc / diagram-eval
View on GitHub
[EMNLP 2025] DiagramEval: Evaluating LLM-Generated Diagrams via Graphs
☆17Nov 1, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PeterGriffinJin / InstructG2I
View on GitHub
InstructG2I: Synthesizing Images from Multimodal Attributed Graphs (NeurIPs 2024)
☆19Oct 17, 2024Updated last year
VIM-Bench / VIM_TOOL
View on GitHub
☆12Jun 12, 2024Updated 2 years ago
KoDohwan / VT-TWINS
View on GitHub
Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)
☆11Oct 12, 2022Updated 3 years ago
597358816 / AEPO
View on GitHub
Arbitrary Entropy Policy Optimization: Entropy Is Controllable in Reinforcement Fine-tuning
☆17Jan 19, 2026Updated 6 months ago
sauradip / MUPPET
View on GitHub
[ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"
☆16Aug 30, 2023Updated 2 years ago
JustinYuu / MM_Pyramid
View on GitHub
[ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
☆15Aug 26, 2022Updated 3 years ago
lambert-x / VideoAuteur
View on GitHub
VideoAuteur: Towards Long Narrative Video Generation
☆44Oct 22, 2025Updated 9 months ago