uynaes / RankingAwareCLIPLinks
[ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP
☆16Updated 8 months ago
Alternatives and similar repositories for RankingAwareCLIP
Users that are interested in RankingAwareCLIP are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Diffusion Curriculum (DisCL)☆15Updated 3 months ago
- ☆13Updated 11 months ago
- ☆13Updated last year
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Updated last month
- Official Code for: "DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency"☆30Updated this week
- ☆12Updated last year
- Official Repository of Personalized Visual Instruct Tuning☆33Updated 9 months ago
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Updated last month
- Video Diffusion State Space Models☆19Updated last year
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆51Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Updated last year
- ☆25Updated last year
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆51Updated 5 months ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆22Updated 5 months ago
- ☆56Updated 8 months ago
- ☆11Updated last year
- ☆23Updated 7 months ago
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆25Updated last year
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆65Updated last year
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆35Updated 7 months ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆46Updated 5 months ago
- ☆18Updated 6 months ago
- The official repo of continuous speculative decoding☆31Updated 9 months ago
- ☆39Updated 7 months ago
- ☆20Updated 9 months ago
- [CVPR 2024 Highlight] ImageNet-D☆46Updated last year
- A Comprehensive Dataset for Advanced Image Generation and Editing}☆30Updated 2 months ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆40Updated 6 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆21Updated 9 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Updated last year