code for FineLIP
☆43Nov 25, 2025Updated 7 months ago
Alternatives and similar repositories for FineLIP
Users that are interested in FineLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆144Mar 12, 2026Updated 3 months ago
- ☆17Jan 30, 2024Updated 2 years ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆18Sep 11, 2024Updated last year
- ☆11Oct 20, 2023Updated 2 years ago
- This is the official repo of OpenSatMap in NeurIPS 2024 D&B Track☆31Jul 6, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆11Aug 26, 2024Updated last year
- ☆13Aug 14, 2022Updated 3 years ago
- ☆38Jul 24, 2023Updated 2 years ago
- [CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".☆31May 12, 2025Updated last year
- ☆22Aug 8, 2024Updated last year
- ☆30Aug 14, 2023Updated 2 years ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆88May 10, 2025Updated last year
- ☆23May 8, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆37Mar 28, 2025Updated last year
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆36May 29, 2024Updated 2 years ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- VGI-Enhanced multimodal large language model for remote sensing images.☆191Mar 4, 2025Updated last year
- ☆43Jan 12, 2026Updated 5 months ago
- ☆28May 20, 2026Updated last month
- ☆39Jan 20, 2024Updated 2 years ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆24Dec 4, 2024Updated last year
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆35Jul 18, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆15Oct 22, 2024Updated last year
- CausalDynamics: A large-scale benchmark for structural discovery of dynamical causal models☆34Oct 6, 2025Updated 8 months ago
- This repository contains the implementation of the method described in our paper, "Divide and Conquer: Isolating Normal-Abnormal Attribut…☆11Apr 9, 2024Updated 2 years ago
- [CVPR 2025] This repository is intended to store the code and data for ASAP (Advancing Semantic Alignment Promotes Multi-Modal Manipulati…☆21Jun 18, 2025Updated last year
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆100Mar 26, 2025Updated last year
- Extract video features. Currently, the models includes I3D, will be continuously updated.☆12Jun 4, 2020Updated 6 years ago
- ☆16Mar 17, 2025Updated last year
- 模仿 TensorFlow 写的极简深度学习框架,仅供练习目的☆14Sep 11, 2018Updated 7 years ago
- Towards Defending against Adversarial Examples via Attack-Invariant Features☆13Oct 12, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Oct 18, 2024Updated last year
- [ICASSP 2025] Open-source code for the paper "Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification"☆64Oct 13, 2025Updated 8 months ago
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆22Jul 29, 2025Updated 11 months ago
- Affinity based segmentation algorithms and tools☆12May 12, 2025Updated last year
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆25Mar 30, 2026Updated 3 months ago
- ACM MM 2022 - PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding☆11Aug 12, 2022Updated 3 years ago
- ☆16Sep 22, 2021Updated 4 years ago