Benchmarking Multi-Step Spatial Reasoning in MLLMs with LEGO-based VQA & generation tasks.
☆36Jun 20, 2025Updated 9 months ago
Alternatives and similar repositories for LEGO-Puzzles
Users that are interested in LEGO-Puzzles are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of CharacterShot: Controllable and Consistent 4D Character Animation☆49Feb 27, 2026Updated last month
- code for AAAI accepted paper Similarity Distribution based Membership Inference Attack on Person Re-Identification.☆11Sep 29, 2024Updated last year
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- ☆17Apr 17, 2025Updated 11 months ago
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Jul 22, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events☆10Dec 7, 2024Updated last year
- Official code for paper "OpenCIL: Benchmarking Out-of-Distribution Detection in Class-Incremental Learning"☆11Jun 19, 2024Updated last year
- The Notion Citation Updater is a Python script designed to automate the process of updating citation counts for academic papers stored in…☆15Oct 28, 2024Updated last year
- 🤖在树莓派zero上开发tensorflow-lite的C++环境 | a C++ Environment for Building Tensorflow-lite Projects on Raspberry Pi Zero (armv6)☆10Apr 13, 2021Updated 4 years ago
- StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型,无需针对图片微调,即能生成高质量的个性风格化图片!☆458Jun 30, 2025Updated 9 months ago
- Official implementation of "TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization" (Findings of ACL …☆21Jul 25, 2025Updated 8 months ago
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆130Jul 5, 2024Updated last year
- ☆11Jun 28, 2024Updated last year
- LMM for VQA, tcsvt version☆10Jul 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Backup repo for "MD-VQA: Multi-Dimensional Quality Assessment for UGC Live Videos"☆14Feb 16, 2024Updated 2 years ago
- [ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluat…☆44Jun 11, 2025Updated 10 months ago
- A curated list of Person Re-Identification papers and BibTeX entries☆19Feb 24, 2024Updated 2 years ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 6 months ago
- CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics☆28Nov 1, 2025Updated 5 months ago
- Code for MInD: Multimodal Information Disentanglement☆19Dec 17, 2025Updated 3 months ago
- Ubuntu 配置脚本 全功能美化一键安装 Linux Auto Configuration Script for ubuntu 14.04 to 22.04☆19Feb 16, 2026Updated last month
- ☆11Sep 1, 2024Updated last year
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆19Feb 1, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- PRODeep: A Platform for Robustness Verification of Deep Neural Networks☆12Nov 11, 2020Updated 5 years ago
- This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.☆14Oct 29, 2023Updated 2 years ago
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆14Jun 14, 2024Updated last year
- 💯收作业系统 | 作业提交系统——这是一个基于Python Flask框架编写的Web应用,用于收集班级作业。☆22Jul 6, 2022Updated 3 years ago
- Geo-metric A Perceptual Dataset of Distortions on Faces" by Wolski et al., SIGGRAPH Asia 2022.☆24Nov 9, 2022Updated 3 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 10 months ago
- ☆11Jun 2, 2022Updated 3 years ago
- DELTA: Decomposed Efficient Long-Term Robot Task Planning using Large Language Models☆23Apr 16, 2025Updated 11 months ago
- [ICLR 2025] SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction☆40Mar 24, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 📜我的Vim和Neovim配置 | My Vim & Neovim config☆32Nov 25, 2025Updated 4 months ago
- Official code for "Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization"☆17Aug 7, 2024Updated last year
- LocalHost of PIA in Windows☆14Dec 25, 2023Updated 2 years ago
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆17Aug 3, 2025Updated 8 months ago
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- Benchmarks for the VNN Comp 2023☆16Jun 7, 2024Updated last year
- [CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image…☆80Feb 26, 2026Updated last month