MBZUAI-LLM/web2code

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MBZUAI-LLM/web2code)

MBZUAI-LLM / web2code

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

☆103

Alternatives and similar repositories for web2code

Users that are interested in web2code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

David-Li0406 / SMoA
View on GitHub
☆15Jan 24, 2025Updated last year
HumanEval-V / HumanEval-V-Benchmark
View on GitHub
A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks
☆15Feb 25, 2025Updated last year
AlexCuadron / ThinkingAgent
View on GitHub
Systematic evaluation framework that automatically rates overthinking behavior in large language models.
☆103May 16, 2025Updated last year
WebPAI / DCGen
View on GitHub
☆44Dec 8, 2025Updated 7 months ago
Jiacheng8 / CV-DD
View on GitHub
Dataset Distillation via Committee Voting
☆15Jul 28, 2025Updated 11 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jinzhuoran / RAG-RewardBench
View on GitHub
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆18Dec 19, 2024Updated last year
pengshuai-rin / MultiMath
View on GitHub
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
☆33Jan 22, 2025Updated last year
zhuyunqi96 / LoraLPrun
View on GitHub
☆13May 21, 2023Updated 3 years ago
MetaAgentX / OpenCaptchaWorld
View on GitHub
[NeurIPS 2025] The first web-based benchmark and platform to evaluate visual reasoning and interaction capabilities of MLLM powered agent…
☆82Feb 19, 2026Updated 5 months ago
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated 2 years ago
lt-asset / Waffle
View on GitHub
For ACL25 paper "WAFFLE: Multi-Modal Model for Automated Front-End Development" - by Shanchao Liang and Nan Jiang and Shangshu Qian and L…
☆12May 28, 2025Updated last year
chtmp223 / suri
View on GitHub
Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]
☆27Oct 3, 2025Updated 9 months ago
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Updated this week
GAIR-NLP / MAYE
View on GitHub
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
☆149Apr 9, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Lzq5 / Video-Text-Alignment
View on GitHub
☆28Jul 18, 2025Updated last year
C0nsumption / Consume-Blip3
View on GitHub
XGEN-MM(BLIP3) Autocaptioning Tools
☆17Jun 20, 2024Updated 2 years ago
junhahyung / MagiCapture
View on GitHub
☆11Feb 26, 2024Updated 2 years ago
TencentARC / Plot2Code
View on GitHub
☆23Aug 17, 2024Updated last year
WeihuangLin / INF-LLaVA
View on GitHub
INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model
☆42Aug 4, 2024Updated last year
YuigaWada / Polos
View on GitHub
[CVPR24 Highlights] Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
☆33Jun 12, 2026Updated last month
haoningwu3639 / MRGen
View on GitHub
[ICCV 2025] MRGen: Segmentation Data Engine for Underrepresented MRI Modalities
☆41Sep 26, 2025Updated 9 months ago
RLHF-V / RLAIF-V
View on GitHub
[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
☆456May 14, 2025Updated last year
Yaxin9Luo / Gamma-MOD
View on GitHub
[ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models
☆45Oct 28, 2025Updated 8 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Keytoyze / JumpCoder
View on GitHub
Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via Online Modification"
☆25May 18, 2024Updated 2 years ago
mu-cai / matryoshka-mm
View on GitHub
Matryoshka Multimodal Models
☆123Jan 22, 2025Updated last year
jylei16 / Imagine-e
View on GitHub
☆14Jan 22, 2025Updated last year
google-research-datasets / uicrit
View on GitHub
UICrit is a dataset containing human-generated natural language design critiques, corresponding bounding boxes for each critique, and des…
☆27Nov 19, 2024Updated last year
ZZR0 / ISSTA22-CodeStudy
View on GitHub
This repo illustrates how to evaluate the artifacts in the paper An Extensive Study on Pre-trained Models for Program Understanding and G…
☆27Aug 12, 2022Updated 3 years ago
thunlp / LLaVA-UHD
View on GitHub
LLaVA-UHD v3: Progressive Visual Compression for Efficient Native-Resolution Encoding in MLLMs
☆423Jul 6, 2026Updated 2 weeks ago
showlab / VisInContext
View on GitHub
Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning
☆28Oct 30, 2024Updated last year
bor0 / dafny-tutorial
View on GitHub
Exercises for the Dafny Tutorial
☆14May 21, 2018Updated 8 years ago
MNoorFawi / curlora
View on GitHub
The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.
☆53Aug 28, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
thunlp / ChartCoder
View on GitHub
[ACL'25 Main] ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation
☆79Dec 8, 2025Updated 7 months ago
royeisen / reasoning_loading_bar
View on GitHub
☆56Jul 7, 2025Updated last year
hewei2001 / ReachQA
View on GitHub
[EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs
☆61Aug 25, 2025Updated 10 months ago
cofe-ai / Sketch
View on GitHub
☆18Sep 5, 2024Updated last year
zwq2018 / Multi-modal-Self-instruct
View on GitHub
The codebase for our EMNLP24 paper: Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Mo…
☆85Jan 27, 2025Updated last year
si0wang / ViCrit
View on GitHub
☆24Jun 18, 2025Updated last year
zzxslp / SoM-LLaVA
View on GitHub
[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
☆145Aug 23, 2024Updated last year