☆70Oct 9, 2024Updated last year
Alternatives and similar repositories for ViPer
Users that are interested in ViPer are comparing it to the libraries listed below
Sorting:
- Music production for silent film clips.☆32Apr 30, 2025Updated 10 months ago
- ☆15Mar 30, 2025Updated 11 months ago
- ☆16Jun 14, 2024Updated last year
- [WACV 2025] Uniform Attention Maps: Enhancing Image Fidelity in Reconstruction and Editing☆17Mar 16, 2025Updated 11 months ago
- ☆66Jun 4, 2024Updated last year
- Official repo for DiffArtist (ACM MM 2025)☆124Jul 5, 2025Updated 7 months ago
- ☆86Aug 21, 2024Updated last year
- ☆294Aug 30, 2024Updated last year
- [ICML 2025] EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM☆72Jul 16, 2025Updated 7 months ago
- Implementation of layer diffuse inference using refiners☆25Apr 25, 2024Updated last year
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Mar 18, 2025Updated 11 months ago
- [NeurIPS'2024] Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps☆101Jul 4, 2024Updated last year
- Run text-to-video synthesis in webui.☆25Mar 20, 2023Updated 2 years ago
- ☆26Jun 5, 2024Updated last year
- Official implementation of "DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization" (…☆174Feb 27, 2024Updated 2 years ago
- An official implementation of SwapAnyone.☆74Mar 14, 2025Updated 11 months ago
- [NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching☆168Nov 18, 2024Updated last year
- Rare-to-Frequent (R2F), ICLR'25, Spotlight☆53Apr 23, 2025Updated 10 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆103Jul 5, 2024Updated last year
- Motion Module fine tuner for AnimateDiff.☆78Oct 30, 2023Updated 2 years ago
- This is the official repository for "LatentMan: Generating Consistent Animated Characters using Image Diffusion Models" [CVPRW 2024]☆22Jul 21, 2024Updated last year
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- Implementation for "Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffu…☆13Sep 8, 2023Updated 2 years ago
- ☆14Mar 23, 2023Updated 2 years ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 7 months ago
- Official Implementation of PairCustomization SIGGRAPH Asia 2024☆105Jul 20, 2025Updated 7 months ago
- A diffusers pipeline for zero shot stylised couples portrait creation☆100Dec 10, 2024Updated last year
- Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Func…☆29Dec 2, 2024Updated last year
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆110Nov 24, 2025Updated 3 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆153Jun 25, 2024Updated last year
- CLIP and PASTE: Using AI to Create Photo Collages from Text Prompts☆29Jun 11, 2022Updated 3 years ago
- ☆33Aug 9, 2024Updated last year
- NOVA-3D: Non-overlapped Views for 3D Anime Character Reconstruction☆26Mar 14, 2024Updated last year
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆82Jun 11, 2024Updated last year
- ☆78May 23, 2025Updated 9 months ago
- ☆13Jul 10, 2024Updated last year
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 4 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated last year
- ☆13Sep 16, 2022Updated 3 years ago