[IJCV 2026] HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts
☆26Feb 28, 2025Updated last year
Alternatives and similar repositories for HiPrompt
Users that are interested in HiPrompt are comparing it to the libraries listed below
Sorting:
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- The official implementation of CCDM and iCCDM.☆23Feb 7, 2026Updated 3 weeks ago
- [ECCV2024] Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation☆67Mar 24, 2025Updated 11 months ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆148Oct 9, 2025Updated 4 months ago
- ☆11Nov 30, 2025Updated 3 months ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- [ICCV 2025] Diffusion Curriculum (DisCL)☆18Sep 26, 2025Updated 5 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- ☆13Jul 10, 2024Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 4 months ago
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 6 months ago
- ☆16May 13, 2025Updated 9 months ago
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation☆16Aug 3, 2025Updated 6 months ago
- ☆16Feb 21, 2025Updated last year
- ☆17Aug 8, 2024Updated last year
- [ICCV2025] Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆61Jun 27, 2025Updated 8 months ago
- ☆16Jun 14, 2024Updated last year
- InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation 🔥☆150Jul 17, 2024Updated last year
- official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusi…☆18Sep 5, 2024Updated last year
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Feb 14, 2025Updated last year
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆111Sep 19, 2025Updated 5 months ago
- [ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".☆91Feb 6, 2026Updated 3 weeks ago
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging☆47Apr 27, 2025Updated 10 months ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆48Aug 19, 2024Updated last year
- Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"☆314Dec 23, 2024Updated last year
- ☆17Feb 20, 2025Updated last year
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆22Nov 20, 2024Updated last year
- [AAAI 2025 Oral] GURecon: Learning Detailed 3D Geometric Uncertainties for Neural Surface Reconstruction☆23Jul 21, 2025Updated 7 months ago
- SmartCLIP: A training method to improve CLIP with both short and long texts☆37Jun 18, 2025Updated 8 months ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated 11 months ago
- An innovative method designed to augment the capabilities of existing video diffusion models☆22May 10, 2024Updated last year
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Jan 14, 2026Updated last month
- VideoAuteur: Towards Long Narrative Video Generation☆43Oct 22, 2025Updated 4 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆97Mar 12, 2025Updated 11 months ago
- ☆53Dec 20, 2024Updated last year
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆42Feb 18, 2026Updated last week
- T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation☆36Sep 16, 2025Updated 5 months ago
- [NeurIPS'25] ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and R…☆31Sep 27, 2025Updated 5 months ago