damian0815 / finetune-clip-huggingface
Finetuning CLIP on a small image/text dataset using huggingface libs
☆40Updated last year
Related projects ⓘ
Alternatives and complementary repositories for finetune-clip-huggingface
- Official Implementations "StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing" (CVMJ2024)☆58Updated 3 months ago
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆80Updated 3 months ago
- CLIP-based aesthetics predictor inspired by the interface of 🤗 huggingface transformers.☆29Updated 4 months ago
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆27Updated 3 months ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆105Updated 9 months ago
- [ICCV 2023] Zero-shot image editing with stochastic diffusion models☆42Updated last year
- Image Editing Anything☆112Updated last year
- Implementation of InstructEdit☆67Updated last year
- This repository provides utilities to a minimal dataset for InstructPix2Pix like training for Diffusion models.☆43Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆73Updated last year
- A Diffusion training toolbox based on diffusers and existing SOTA methods, including Dreambooth, Texual Inversion, LoRA, Custom Diffusion…☆77Updated last month
- [ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion☆249Updated 3 months ago
- FuseCap: Large Language Model for Visual Data Fusion in Enriched Caption Generation☆48Updated 6 months ago
- MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance (ACM MM2024)☆95Updated last week
- Code Implementation of "Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model"☆107Updated 10 months ago
- [ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions☆106Updated 2 weeks ago
- [CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis☆147Updated last year
- ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆33Updated 4 months ago
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)☆41Updated last month
- ☆119Updated last month
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆111Updated last month
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆132Updated 6 months ago
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆117Updated 6 months ago
- Official implementation of the paper "Z∗: Zero-shot Style Transfer via Attention Rearrangement" a.k.a. "Z∗: Zero-shot Style Transfer via …☆52Updated last month
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing☆74Updated 6 months ago
- [ICCV 2023] Phasic Content Fusing Diffusion Model with Directional Distribution Consistency for Few-Shot Model Adaption☆54Updated 11 months ago
- An interactive demo based on Segment-Anything for style transfer which enables different content regions apply different styles.☆96Updated last year
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆100Updated 4 months ago
- ☆91Updated 5 months ago
- [AAAI 2023] CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying☆83Updated 11 months ago