[CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".
☆32May 12, 2025Updated 9 months ago
Alternatives and similar repositories for FG-CLIP
Users that are interested in FG-CLIP are comparing it to the libraries listed below
Sorting:
- [CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…☆66Apr 4, 2025Updated 10 months ago
- [ICML2024] Official PyTorch implementation of CoMC: Language-Driven Cross-Modal Classifier for Zero-Shot Multi-Label Image Recognition☆16Jul 9, 2024Updated last year
- Code for the paper "Manipulating Embeddings of Stable Diffusion Prompts".☆15Aug 8, 2024Updated last year
- ☆22Apr 22, 2025Updated 10 months ago
- [ICCV 2025] Deeply Supervised Flow-Based Generative Models☆28Jun 26, 2025Updated 8 months ago
- [AAAI 2026 Oral] The official code of "UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning"☆64Dec 8, 2025Updated 2 months ago
- A Massive Multi-Discipline Lecture Understanding Benchmark☆32Nov 1, 2025Updated 4 months ago
- code for FineLIP☆38Nov 25, 2025Updated 3 months ago
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆32Jun 3, 2025Updated 9 months ago
- [ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction☆201Feb 5, 2024Updated 2 years ago
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆85Aug 6, 2025Updated 6 months ago
- ☆35Nov 25, 2025Updated 3 months ago
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆35May 29, 2024Updated last year
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆41Jun 22, 2024Updated last year
- [ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model☆53Oct 12, 2025Updated 4 months ago
- Reinforcement learning environment for UR5e robot with OPENAI gym like format. Include both simulation and real parts.☆14Nov 2, 2021Updated 4 years ago
- [NeurIPS 2022 Spotlight] RLIP: Relational Language-Image Pre-training and a series of other methods to solve HOI detection and Scene Grap…☆78May 26, 2024Updated last year
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated 11 months ago
- Heatmap-based Out-of-Distribution Detection (WACV 2023)☆13Mar 27, 2024Updated last year
- A generic tensorflow library for robotics: a bridge between robotics problem and modern machine learning architecture. Provides forward k…☆13Apr 12, 2024Updated last year
- A mouse brain histology tool for neuroscientists.☆13Feb 16, 2026Updated 2 weeks ago
- Modern normalizing flows in Python. Simple to use and easily extensible.☆12Feb 11, 2026Updated 2 weeks ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆14Jul 31, 2025Updated 7 months ago
- Source codes of Learning Causal Representations for Robust Domain Adaptation (IEEE TKDE)☆12Feb 14, 2022Updated 4 years ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆29Feb 4, 2026Updated 3 weeks ago
- [ICML2023] InfoOT: Information Maximizing Optimal Transport☆41Apr 27, 2023Updated 2 years ago
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆97Mar 26, 2025Updated 11 months ago
- Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.☆71Aug 8, 2025Updated 6 months ago
- Visual self-questioning for large vision-language assistant.☆45Jul 23, 2025Updated 7 months ago
- [ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"☆893Aug 13, 2024Updated last year
- Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training (ACL 2023))☆92Jun 12, 2023Updated 2 years ago
- ☆44Jan 14, 2026Updated last month
- Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020☆11Aug 28, 2020Updated 5 years ago
- ☆10Oct 31, 2020Updated 5 years ago
- ☆12May 20, 2025Updated 9 months ago
- A simple script to add pdf-files to Zotero via CLI☆12May 17, 2020Updated 5 years ago
- Python solutions to coding questions in Leetcode☆13Sep 12, 2020Updated 5 years ago
- Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes☆10Jul 10, 2024Updated last year