[CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model
β55May 31, 2025Updated 9 months ago
Alternatives and similar repositories for FreSca
Users that are interested in FreSca are comparing it to the libraries listed below
Sorting:
- β24Nov 1, 2024Updated last year
- [π IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound β¦β28Nov 1, 2025Updated 4 months ago
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesisβ35Feb 15, 2024Updated 2 years ago
- [CVPR 2025] VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?β29May 10, 2025Updated 10 months ago
- Use βDICE-Talkβ in ComfyUIοΌwhich is a method about 'Correlation-Aware Emotional Talking Portrait Generation'.β25May 7, 2025Updated 10 months ago
- DMM: Building a Versatile Image Generation Model via Distillation-Based Model Mergingβ47Apr 27, 2025Updated 10 months ago
- β20May 11, 2025Updated 10 months ago
- ComfyUI port of SDWebUI Vectorscope CC and Diffusion CG extensionsβ20Feb 24, 2025Updated last year
- Nodes for image juxtaposition for Flux in ComfyUIβ12Apr 22, 2025Updated 11 months ago
- β34May 7, 2025Updated 10 months ago
- β11Dec 8, 2025Updated 3 months ago
- Try OmniParser inComfyUI which a simple screen parsing tool towards pure vision based GUI agent.β39Mar 12, 2025Updated last year
- [Official Implementation] Improving Editability in Image Generation with Layer-wise Memory, CVPR 2025β37Mar 2, 2026Updated 3 weeks ago
- AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generationβ16Aug 3, 2025Updated 7 months ago
- Wrapper of DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion, run in diffusers modeβ29Feb 26, 2026Updated 3 weeks ago
- Comfy UI nodes for Flex.1β80Aug 5, 2025Updated 7 months ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.β15Feb 1, 2025Updated last year
- You can use SHMT method to apply makeup to the characters when use ComfyUIβ30Jan 9, 2025Updated last year
- Implementing FlowEdit, maybe other inversion techniques for the Wan video generation modelβ54Feb 28, 2025Updated last year
- Custom node to translate prompts into Chineseβ11Sep 15, 2024Updated last year
- (CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Modelsβ52Sep 10, 2025Updated 6 months ago
- β11May 22, 2024Updated last year
- TPDiff: Temporal Pyramid Video Diffusion Modelβ25Mar 13, 2025Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformerβ16Sep 7, 2024Updated last year
- [ICCV 2025] Edicho: Consistent Image Editing in the Wildβ124Oct 22, 2025Updated 5 months ago
- A set of nodes to prepare the noise predictions before the CFG functionβ63May 24, 2025Updated 9 months ago
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)β20Jan 18, 2026Updated 2 months ago
- [AAAI 2025] Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understandingβ34Mar 21, 2025Updated last year
- Wildly unsound and experimental sampling for ComfyUIβ29Aug 9, 2025Updated 7 months ago
- ComfyUI nodes to use APG scaling for CFGβ29Oct 6, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".β118May 3, 2025Updated 10 months ago
- Official repo for CFG-Zero*β706May 2, 2025Updated 10 months ago
- [AAAI 2026] This is the official implementation of "T-LoRA: Single Image Diffusion Model Customization Without Overfitting"β133Feb 16, 2026Updated last month
- ICML2025β65Aug 28, 2025Updated 6 months ago
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmupβ36Oct 15, 2025Updated 5 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"β10Jul 19, 2024Updated last year
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)β86Feb 27, 2025Updated last year
- Official code of "MakeAnything: Harnessing Diffusion Transformers for Multi-Domain Procedural Sequence Generation"β205Apr 1, 2025Updated 11 months ago
- β20Nov 23, 2025Updated 3 months ago