[ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP
☆16Apr 17, 2025Updated 11 months ago
Alternatives and similar repositories for RankingAwareCLIP
Users that are interested in RankingAwareCLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV'24] Self-training Room Layout Estimation via Geometry-aware Ray-casting☆15Jan 20, 2025Updated last year
- [ECCV2024] Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance☆23Jul 14, 2024Updated last year
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 9 months ago
- ☆14May 20, 2025Updated 10 months ago
- ☆10Aug 9, 2023Updated 2 years ago
- Codes and generated datasets for Paper "Multi-task deep learning for large-scale building detail extraction from high-resolution satellit…☆12Feb 19, 2024Updated 2 years ago
- SGLang Kernel Wheel Index☆17Updated this week
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Oct 21, 2024Updated last year
- Official toolkit for Multi-View Layout Estimation Challenge in OmniCV workshop at CVPR'23.☆16Jun 1, 2023Updated 2 years ago
- [ECCV2022] 3D-PL: Domain Adaptive Depth Estimation with 3D-aware Pseudo-Labeling☆17Sep 20, 2022Updated 3 years ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆23Mar 29, 2025Updated 11 months ago
- [NeurIPS'22] 360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning☆14Apr 3, 2025Updated 11 months ago
- ☆14Mar 6, 2026Updated 2 weeks ago
- End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021☆18Oct 24, 2021Updated 4 years ago
- ☆17Sep 23, 2025Updated 6 months ago
- Scene Parsing with Global Context Embedding, ICCV 2017☆22Feb 28, 2018Updated 8 years ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆47Mar 3, 2026Updated 3 weeks ago
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer your physics question!☆16Aug 24, 2024Updated last year
- Open Source Road Datasets☆18Aug 30, 2024Updated last year
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Feb 22, 2026Updated last month
- Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.☆20Sep 26, 2021Updated 4 years ago
- Fast and memory-efficient exact attention☆29Dec 2, 2024Updated last year
- KSimply: An AI Potential Analyzer that recommends open-source models based on user hardware. / Un analizzatore di potenziale AI che consi…☆15Updated this week
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆27Oct 28, 2024Updated last year
- [ACM MM 2025] MLLMs for Aesthetics Reasoning☆24Jan 5, 2026Updated 2 months ago
- ☆15Feb 13, 2025Updated last year
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆23Jan 27, 2026Updated last month
- [ICCV2025]LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models☆84Sep 8, 2025Updated 6 months ago
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆49Jan 25, 2026Updated last month
- Unseen Object Segmentation in Videos via Transferable Representations, ACCV 2018 (oral)☆25Apr 21, 2021Updated 4 years ago
- A fast, lightweight, and extensible RWKV chat UI powered by Flutter. Offline-ready, multi-backend support, ideal for local RWKV inference…☆83Updated this week
- [ICML 2022] Region-Based Semantic Factorization in GANs☆71Dec 24, 2022Updated 3 years ago
- This repository contains code for explaining prototypes learned by ProtoPNet, by quantifying the influence of color hue, shape, texture, …☆17Mar 31, 2021Updated 4 years ago
- Official implementation of "Weakly-supervised positional contrastive learning: application to cirrhosis classification", MICCAI 2023☆11Dec 16, 2025Updated 3 months ago
- [CVPR 2024] Action-slot: Visual Action-centric Representations for Atomic Activity Recognition in Traffic Scenes☆24Apr 28, 2025Updated 10 months ago
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Domain adaptation framework for segmentation via reinforcement learning.☆13Oct 13, 2025Updated 5 months ago