antgroup / SparkUI-ParserLinks
β22Updated 2 months ago
Alternatives and similar repositories for SparkUI-Parser
Users that are interested in SparkUI-Parser are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption πβ46Updated 4 months ago
- Unified Multi-modal IAA Baseline and Benchmarkβ90Updated last year
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin Lβ¦β56Updated last year
- [ECCV2024] Towards Reliable Advertising Image Generation Using Human Feedbackβ58Updated last year
- [ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Modelβ36Updated 11 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.β84Updated 6 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesisβ86Updated last year
- This repository open-sources CreatiPoster, an AI-driven graphic design generation system for multi-layer and editable compositions with sβ¦β70Updated 4 months ago
- Benchmark dataset and code of MSRVTT-Personalizationβ49Updated 4 months ago
- β26Updated 6 months ago
- Codebase for "Jodi: Unification of Visual Generation and Understanding via Joint Modeling"β84Updated 4 months ago
- SFT+RL boosts multimodal reasoningβ37Updated 4 months ago
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)β48Updated 6 months ago
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPTβ135Updated last year
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025β13Updated last year
- Code of our paper "A Unified Agentic Framework for Evaluating Conditional Image Generation".β28Updated 3 months ago
- ICML 2025 - Impossible Videosβ78Updated 3 months ago
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generationβ69Updated last year
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generationβ109Updated 6 months ago
- [CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?β100Updated 3 months ago
- [NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generationβ16Updated 8 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.β50Updated last year
- This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA benchmark performβ¦β71Updated last month
- [CVPR 2025] T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generationβ98Updated 2 weeks ago
- β66Updated last year
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generationβ33Updated 4 months ago
- [TBench 2024] Official implementation of "AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AI"β45Updated last year
- β36Updated last year
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Modelsβ84Updated last month
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editingβ69Updated 3 months ago