raywang335 / L2RCLIPLinks
The official implementation of the paper "Learning-to-Rank Meets Language: Boosting Language-Driven Ordering Alignment for Ordinal Classification" is available.
☆13Updated 2 years ago
Alternatives and similar repositories for L2RCLIP
Users that are interested in L2RCLIP are comparing it to the libraries listed below
Sorting:
- [FG 2021🎈] A small-scale face image dataset with large-scale facial attributes for text-to-face generation and manipulation.☆49Updated 2 years ago
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"☆87Updated 2 years ago
- Code for "DreamEdit: Subject-driven Image Editing" (TMLR2023)☆109Updated 2 years ago
- CVPR2023 paper☆52Updated 2 years ago
- [CVPR 2023 (Highlight)] FAME-ViL: Multi-Tasking V+L Model for Heterogeneous Fashion Tasks☆56Updated 2 years ago
- ☆119Updated last year
- This respository contains the code for the CVPR 2023 paper SINE: SINgle Image Editing with Text-to-Image Diffusion Models.☆190Updated 2 years ago
- [ECCV 2022] FashionViL: Fashion-Focused V+L Representation Learning☆61Updated 3 years ago
- A large-scale visual-language face dataset with fine-grained annotations (ICCV 2021)☆70Updated 3 years ago
- Official implementation of Faceptor: A Generalist Model for Face Perception.☆49Updated last year
- Official Pytorch implementation of "CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion" (TMLR 2024)☆88Updated last year
- [ICCV 2023 Oral, Best Paper Finalist] ITI-GEN: Inclusive Text-to-Image Generation☆69Updated last year
- Official code of "StyleT2I: Toward Compositional and High-Fidelity Text-to-Image Synthesis" (CVPR 2022)☆43Updated 3 years ago
- Implementation of InstructEdit☆76Updated 2 years ago
- ☆24Updated 2 years ago
- Implementation of P+: Extended Textual Conditioning in Text-to-Image Generation☆49Updated 2 years ago
- [ICCV 2023] Controllable Person Image Synthesis with Pose‑Constrained Latent Diffusion☆43Updated 2 years ago
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆120Updated 5 months ago
- ☆93Updated 2 years ago
- A curated list of text-based image manipulation methods.☆84Updated last year
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties☆131Updated last year
- Implementation UniTune based on stable diffusion☆40Updated 3 years ago
- [NeurIP'22] OrdinalCLIP: Learning Rank Prompts for Language-Guided Ordinal Regression☆53Updated last year
- (CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision☆136Updated last year
- ☆15Updated last year
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆33Updated 10 months ago
- ☆130Updated last year
- Ablating Concepts in Text-to-Image Diffusion Models (ICCV 2023)☆167Updated last year
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆136Updated last year
- Research code for paper "Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis"☆115Updated last year