Tangkexian/LEGO-Puzzles

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Tangkexian/LEGO-Puzzles)

Tangkexian / LEGO-Puzzles

Benchmarking Multi-Step Spatial Reasoning in MLLMs with LEGO-based VQA & generation tasks.

☆37

Alternatives and similar repositories for LEGO-Puzzles

Users that are interested in LEGO-Puzzles are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jeoyal / CharacterShot
View on GitHub
Official implementation of CharacterShot: Controllable and Consistent 4D Character Animation
☆51Apr 14, 2026Updated 3 months ago
open-mmlab / FaceShot
View on GitHub
Official repo for FaceShot: Bring Any Character into Life
☆83Jun 30, 2025Updated last year
Tencent / MegaStyle
View on GitHub
MegaStyle, 面向一致性与多样性的可扩展风格数据生成框架
☆131Updated this week
open-mmlab / StyleShot
View on GitHub
StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型，无需针对图片微调，即能生成高质量的个性风格化图片!
☆471Jun 30, 2025Updated last year
ABTols / ColorSurge
View on GitHub
[Siggraph2025] The official code of the paper "ColorSurge: Bringing Vibrancy and Efficiency to Automatic Video Colorization via Dual-Bran…
☆15Jul 26, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
open-compass / Creation-MMBench
View on GitHub
Assessing Context-Aware Creative Intelligence in MLLMs
☆23Jul 22, 2025Updated last year
zengyh1900 / handy_voting
View on GitHub
handy tools for user study
☆21May 21, 2024Updated 2 years ago
mala-lab / OpenCIL
View on GitHub
Official code for paper "OpenCIL: Benchmarking Out-of-Distribution Detection in Class-Incremental Learning"
☆13Jun 19, 2024Updated 2 years ago
drnighthan / PIA_LocalHost_Windows
View on GitHub
LocalHost of PIA in Windows
☆13Dec 25, 2023Updated 2 years ago
open-mmlab / AnyControl
View on GitHub
[ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…
☆132Jul 5, 2024Updated 2 years ago
stogiannidis / srbench
View on GitHub
Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"
☆19Feb 1, 2026Updated 5 months ago
NewComer00 / tflite4zero_env
View on GitHub
🤖在树莓派zero上开发tensorflow-lite的C++环境 | a C++ Environment for Building Tensorflow-lite Projects on Raspberry Pi Zero (armv6)
☆10Apr 13, 2021Updated 5 years ago
UCSB-AI / MSSBench
View on GitHub
[ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"
☆36Jun 23, 2025Updated last year
camenduru / PIA-colab
View on GitHub
☆25Dec 22, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
THUKElab / LatEval
View on GitHub
☆10Mar 19, 2024Updated 2 years ago
jiepengwang / MMGen
View on GitHub
☆17Apr 17, 2025Updated last year
Raphoo / linear-mech-vlms
View on GitHub
Code for "Linear Mechanisms for Spatiotemporal Reasoning in Vision Language Models"
☆15Feb 16, 2026Updated 5 months ago
hmwang2002 / CTRL-S
View on GitHub
[ECCV 2026] Official repository of "Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning".
☆22Updated this week
skywalker023 / thought-tracing
View on GitHub
🚲 Code and benchmark for our COLM 2025 paper - "Thought Tracing: Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models"
☆15Aug 8, 2025Updated 11 months ago
hany01rye / tiger
View on GitHub
TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
☆23Nov 18, 2025Updated 8 months ago
jiahai-feng / binding-iclr
View on GitHub
☆19Mar 5, 2024Updated 2 years ago
prs-eth / StitchVM
View on GitHub
Official code for "Stitched Value Model for Diffusion Alignment"
☆28May 21, 2026Updated 2 months ago
bojone / softtopk
View on GitHub
differentiable top-k operator
☆23Dec 30, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZJU-REAL / ViewSpatial-Bench
View on GitHub
[ECCV 2026] ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models
☆82Mar 9, 2026Updated 4 months ago
zzc-1998 / MD-VQA
View on GitHub
Backup repo for "MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos"
☆14Feb 16, 2024Updated 2 years ago
zzc-1998 / MLLM-QA-Papers-with-Code
View on GitHub
Collections of papers and code for employing MLLM for quality assessment tasks.
☆12Apr 18, 2024Updated 2 years ago
TIGER-AI-Lab / PixelWorld
View on GitHub
The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
☆15Sep 12, 2025Updated 10 months ago
KAIST-Visual-AI-Group / APC-VLM
View on GitHub
[ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
☆66Sep 12, 2025Updated 10 months ago
zzc-1998 / SJTU-H3D
View on GitHub
[TIP 2025] Advancing Zero-Shot Digital Human Quality Assessment through Text-Prompted Evaluation
☆12Jul 8, 2023Updated 3 years ago
krennic999 / ARsample
View on GitHub
Code for paper "Towards Better & Faster Autoregressive Image Generation: From the Perspective of Entropy" [NeurIPS 2025] .
☆18Dec 6, 2025Updated 7 months ago
AI45Lab / IS-Bench
View on GitHub
[AAAI 2026] Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
☆47Nov 24, 2025Updated 7 months ago
mahtabbigverdi / Aurora-perception
View on GitHub
☆50Feb 18, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
1010075746 / NBU-CIQAD
View on GitHub
☆11Jun 2, 2022Updated 4 years ago
JackHck / FVP
View on GitHub
[ICCV 2025] FVP: 4D Visual Pre-training for Robot Learning
☆17Sep 5, 2025Updated 10 months ago
JongSuk1 / AVCap
View on GitHub
☆11Sep 1, 2024Updated last year
h4nwei / 2AFC-LMMs
View on GitHub
[TCSVT'24] Offical Implementation of 2AFC-LMMs
☆12Aug 17, 2024Updated last year
zzc-1998 / GMS-3DQA
View on GitHub
Official repo for "GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment"
☆14Mar 10, 2024Updated 2 years ago
zmzhang2000 / MMMC
View on GitHub
Official repository for Robust Multimodal Large Language Models Against Modality Conflict
☆22Jul 9, 2025Updated last year
rdi-berkeley / awesome-RLVR-boundary
View on GitHub
A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Langu…
☆89Dec 12, 2025Updated 7 months ago