Max-We / sf-zero-signal-to-noise
Trying to implement https://arxiv.org/abs/2305.08891
☆32Updated last year
Alternatives and similar repositories for sf-zero-signal-to-noise:
Users that are interested in sf-zero-signal-to-noise are comparing it to the libraries listed below
- Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"☆46Updated last month
- ☆53Updated last year
- Official pytorch implementation for SingleInsert☆26Updated last year
- Directed Diffusion: Direct Control of Object Placement through Attention Guidance (AAAI2024)☆78Updated last year
- Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆48Updated last week
- ☆25Updated last month
- [ICLR 2024] Code for FreeNoise based on LaVie☆35Updated last year
- [ NeurIPS 2024 D&B Track ] Implementation for "FiVA: Fine-grained Visual Attribute Dataset for Text-to-Image Diffusion Models"☆68Updated 4 months ago
- AAAI 2025: Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation☆38Updated 11 months ago
- ☆22Updated 6 months ago
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Updated last year
- [ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models☆87Updated 8 months ago
- code for "TVG: A Training-free Transition Video Generation Method with Diffusion Models"☆41Updated 8 months ago
- Implementation code of the paper MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing☆57Updated 2 months ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆17Updated 6 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆100Updated last year
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Updated 7 months ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆15Updated 8 months ago
- ☆25Updated last month
- Website source files for Diffusion2GAN Project.☆78Updated 7 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆53Updated 6 months ago
- The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"☆27Updated 2 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last year
- ☆21Updated last year
- [ECCV 2024] HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance☆46Updated 6 months ago
- A retrain of AnimateDiff to be conditional on an init image☆34Updated last year
- MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance☆25Updated 4 months ago
- ☆61Updated 10 months ago
- Subjects200K dataset☆110Updated 3 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆58Updated last week