Benchmarking Multi-Step Spatial Reasoning in MLLMs with LEGO-based VQA & generation tasks.
☆36Jun 20, 2025Updated 9 months ago
Alternatives and similar repositories for LEGO-Puzzles
Users that are interested in LEGO-Puzzles are comparing it to the libraries listed below
Sorting:
- Official implementation of CharacterShot: Controllable and Consistent 4D Character Animation☆49Feb 27, 2026Updated 3 weeks ago
- code for AAAI accepted paper Similarity Distribution based Membership Inference Attack on Person Re-Identification.☆11Sep 29, 2024Updated last year
- ☆37Sep 26, 2024Updated last year
- Assessing Context-Aware Creative Intelligence in MLLMs☆23Jul 22, 2025Updated 8 months ago
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆25Feb 27, 2026Updated 3 weeks ago
- ☆10Mar 18, 2025Updated last year
- [ECCV 2024] AnyControl, a multi-control image synthesis model that supports any combination of user provided control signals. 一个支持用户自由输入控…☆130Jul 5, 2024Updated last year
- ☆11Jun 28, 2024Updated last year
- CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics☆28Nov 1, 2025Updated 4 months ago
- LMM for VQA, tcsvt version☆10Jul 19, 2024Updated last year
- [ICLR 2025] This is the official implementation for the paper: "Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluat…☆44Jun 11, 2025Updated 9 months ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 6 months ago
- Collections of papers and code for employing MLLM for quality assessment tasks.☆13Apr 18, 2024Updated last year
- Code for MInD: Multimodal Information Disentanglement☆18Dec 17, 2025Updated 3 months ago
- [TIP 2025] Advancing Zero-Shot Digital Human Quality Assessment through Text-Prompted Evaluation☆10Jul 8, 2023Updated 2 years ago
- ☆11Sep 1, 2024Updated last year
- [TCSVT'24] Offical Implementation of 2AFC-LMMs☆12Aug 17, 2024Updated last year
- Official repo for "GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment"☆14Mar 10, 2024Updated 2 years ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.☆14Oct 29, 2023Updated 2 years ago
- JPEG-LM: LLMs as Image Generators with Canonical Codec Representations☆14Sep 29, 2024Updated last year
- 💯收作业系统 | 作业提交系统——这是一个基于Python Flask框架编写的Web应用,用于收集班级作业。☆22Jul 6, 2022Updated 3 years ago
- [CVPRW2024, Official Code] for paper "Exploring AIGC Video Quality: A Focus on Visual Harmony, Video-Text Consistency and Domain Distribu…☆14Jun 14, 2024Updated last year
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 10 months ago
- ☆11Jun 2, 2022Updated 3 years ago
- Implementation of QoMEX 2021 "Image Super-Resolution Quality Assessment: Structural Fidelity Versus Statistical Naturalness"☆17Sep 28, 2022Updated 3 years ago
- ☆11Nov 29, 2024Updated last year
- Official code for "Defense Against Adversarial Attacks on No-Reference Image Quality Models with Gradient Norm Regularization"☆17Aug 7, 2024Updated last year
- LocalHost of PIA in Windows☆14Dec 25, 2023Updated 2 years ago
- [WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the futu…☆24Jan 7, 2024Updated 2 years ago
- Benchmarks for the VNN Comp 2023☆16Jun 7, 2024Updated last year
- Code of StyleCrafter on SDXL☆20Jun 25, 2024Updated last year
- A ComfyUI extension for StyleShot.☆16Apr 23, 2025Updated 10 months ago
- ☆18Dec 25, 2023Updated 2 years ago
- Official repo for `LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM', ACM MM2024 Oral☆17Nov 21, 2024Updated last year
- for all, home☆16Mar 6, 2026Updated 2 weeks ago
- (ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation☆12May 21, 2025Updated 10 months ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆19Oct 19, 2025Updated 5 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆24Sep 21, 2025Updated 6 months ago