damian0815 / finetune-clip-huggingface
Finetuning CLIP on a small image/text dataset using huggingface libs
☆41Updated last year
Related projects ⓘ
Alternatives and complementary repositories for finetune-clip-huggingface
- OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Percept…☆73Updated last year
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆81Updated 3 months ago
- Code for Shifted Diffusion for Text-to-image Generation (CVPR 2023)☆160Updated last year
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆62Updated 4 months ago
- [AAAI 2023] Painterly image harmonization in both spatial domain and frequency domain.☆50Updated 5 months ago
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆29Updated this week
- Official Implementations "StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing" (CVMJ2024)☆59Updated 3 months ago
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆117Updated 6 months ago
- CLIP-based aesthetics predictor inspired by the interface of 🤗 huggingface transformers.☆29Updated 5 months ago
- ☆83Updated 10 months ago
- [AAAI 2023] CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying☆83Updated 11 months ago
- Code Implementation of "Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model"☆107Updated 11 months ago
- [ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion☆253Updated last week
- The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.