jingjiqinggong / specp2pLinks
SpecRef: A Fast Training-free Baseline of Specific Reference-Condition Real Image Editing
☆35Updated last year
Alternatives and similar repositories for specp2p
Users that are interested in specp2p are comparing it to the libraries listed below
Sorting:
- The repository for the paper "Image Inversion: A Survey from GANs to Diffusion and Beyond".☆74Updated last month
- Using reference images to control style in text-to-image diffusion models. Based on CSD and IP Adapter☆53Updated 3 months ago
- Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.☆70Updated 3 weeks ago
- [ECCV 2024] Tuning-Free Image Customization with Image and Text Guidance☆144Updated 4 months ago
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆231Updated last month
- ☆21Updated last year
- Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer☆114Updated last month
- Official code base for paper EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidan…☆104Updated last month
- ICCV 2023: Weakly-supervised 3D Pose Transfer with Keypoints☆58Updated last month
- [CVPR24] Official Implementation of 'A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video …☆98Updated last year
- ☆107Updated this week
- 【 ICLR 2025 】I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength☆109Updated 3 months ago
- Official Implementation for "Mask-based modeling for Neural Radiance Fields" (ICLR 2024)☆37Updated last year
- Official implementation of "Generating images with 3D annotations using diffusion models".☆49Updated 10 months ago
- Official implementation of X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models☆154Updated 6 months ago
- OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation☆117Updated this week
- [ICLR 2025] BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities☆144Updated 5 months ago
- Official Code of Logits-Based-Finetuning☆85Updated last week
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆39Updated 11 months ago
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆3Updated 5 months ago
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval☆49Updated 4 months ago
- For paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"☆43Updated 2 years ago
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆97Updated 2 months ago
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆111Updated 8 months ago
- Panorama Generation as a Next-Token Prediction Task.☆20Updated 2 months ago
- ☆42Updated 10 months ago
- [CVPR 2024] Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation☆113Updated last year
- diffusion lora chinese tutorial,虚拟idol训练中文教程☆79Updated 4 months ago
- ☆44Updated 2 months ago
- An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …☆21Updated last year