[CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".
☆32May 12, 2025Updated 11 months ago
Alternatives and similar repositories for FG-CLIP
Users that are interested in FG-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆17Jul 9, 2024Updated last year
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆86Aug 6, 2025Updated 8 months ago
- [ICCV 2025] Official repository of the paper "Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabular…☆183Nov 10, 2025Updated 5 months ago
- Official Code for "A Likelihood Ratio-Based Approach to Segmenting Unknown Objects" [IJCV 2025]☆15Jun 9, 2025Updated 10 months ago
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆42Aug 7, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This repository is the official implementation of our paper Robust Diffusion Model-Generated Image Detection with CLIP, accepted by MIPR …☆10Jun 13, 2024Updated last year
- LiSu: A Dataset and Method for LiDAR Surface Normal Estimation☆19Nov 30, 2025Updated 4 months ago
- Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".☆12Oct 25, 2022Updated 3 years ago
- [ECCVW/TWYN 2024 - Best Workshop Paper] Are CLIP features all you need for Universal Synthetic Image Origin Attribution?☆12Mar 27, 2026Updated 2 weeks ago
- code for FineLIP☆40Nov 25, 2025Updated 4 months ago
- This repository contains the code for our CVPR 2024 paper,☆15Aug 27, 2024Updated last year
- An Evaluation Framework for Temporal Information Extraction Systems☆20Feb 19, 2026Updated last month
- ☆25Jan 19, 2026Updated 2 months ago
- A vision-language model with an improved cross-attention mechanism for scalable streaming inference☆29Mar 9, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated last year
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 8 months ago
- Official repository for ODQA experiments from Decomposed Prompting: A Modular Approach for Solving Complex Tasks, ICLR23☆12Jul 28, 2023Updated 2 years ago
- Learning to Count without Annotations☆23May 24, 2024Updated last year
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆202Feb 5, 2024Updated 2 years ago
- Official implementation of the paper “Endowing Vision-Language Models with System 2 Thinking for Fine-Grained Visual Recognition,” AAAI 2…☆37Jan 30, 2026Updated 2 months ago
- ☆17Oct 22, 2024Updated last year
- ☆13Apr 9, 2024Updated 2 years ago
- [AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"☆70Dec 8, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆16Sep 6, 2024Updated last year
- [MICCAI 2023] (early accept) UOD: universal oneshot detection of anatomical landmarks. https://arxiv.org/abs/2306.07615☆12Jan 4, 2024Updated 2 years ago
- Composed Video Retrieval☆62May 2, 2024Updated last year
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 10 months ago
- Pothole Detection using Ultralytics YOLOv8.☆34Sep 30, 2024Updated last year
- Examples of Verbalized Machine Learning (VML)☆16Mar 16, 2025Updated last year
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆33Jun 3, 2025Updated 10 months ago
- [ICLR 2025] - Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion☆63Nov 30, 2025Updated 4 months ago
- [ICCV2023] Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events☆10Dec 7, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Multiresolution Learning-based Hybrid Transformer-CNN Model for Anatomical Landmark Detection☆12Nov 5, 2023Updated 2 years ago
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆17Oct 17, 2024Updated last year
- Repo of NeurIPS23☆18Oct 25, 2023Updated 2 years ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Sep 11, 2024Updated last year
- ☆18Jun 14, 2024Updated last year
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆145Jan 5, 2026Updated 3 months ago
- ☆20Nov 4, 2023Updated 2 years ago