damian0815 / finetune-clip-huggingface
Finetuning CLIP on a small image/text dataset using huggingface libs
β43Updated 2 years ago
Alternatives and similar repositories for finetune-clip-huggingface:
Users that are interested in finetune-clip-huggingface are comparing it to the libraries listed below
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)β83Updated 5 months ago
- CLIP-based aesthetics predictor inspired by the interface of π€ huggingface transformers.β34Updated 7 months ago
- Code for Learning Subject-Aware Cropping by Outpainting Professional Photosβ14Updated last year
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptβ¦β75Updated last year
- Official Implementations "StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing" (CVMJ2024)β63Updated 5 months ago
- Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)β80Updated last month
- β97Updated 8 months ago
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"β83Updated last year
- Code for the paper "Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment"β87Updated last year
- Official Implementations "Get What You Want, Not What You Don't: Image Content Suppression for Text-to-Image Diffusion Models" (ICLR2024)β43Updated last month
- [AAAI 2023] Painterly image harmonization in both spatial domain and frequency domain.β52Updated 7 months ago
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024β¦β32Updated 2 months ago
- [AAAI 2023] CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Queryingβ83Updated last year
- β117Updated 6 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generationβ66Updated 6 months ago
- Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing (NeurIPS 2023)β93Updated 8 months ago
- Training code for CLIP-FlanT5β21Updated 5 months ago
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilitiesβ¦β118Updated 8 months ago
- ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023β122Updated last year
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"β55Updated last year
- β90Updated last year
- Text Image Inpainting via Global Structure-Guided Diffusion Models (Accepted by AAAI-24)β57Updated 7 months ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)β105Updated 11 months ago
- HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editingβ79Updated 9 months ago
- ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"β35Updated last month
- [CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Modelsβ65Updated 9 months ago
- Official PyTorch implementation of "Scaling Up Personalized Image Aesthetic Assessment via Task Vector Customization" (ECCV 2024)β19Updated 2 months ago
- The benchmark of SOTA text-to-image diffusion models with a new benchmarking strategy based on MiniGPT-4, namely X-IQE.β115Updated last year
- β62Updated last year
- An Pytorch implementation of the paper Key-Locked Rank One Editing for Text-to-Image Personalizationβ81Updated last year