code for FineLIP
☆40Nov 25, 2025Updated 4 months ago
Alternatives and similar repositories for FineLIP
Users that are interested in FineLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code implementation of Generalized Category Discovery in Semantic Segmentation☆18Dec 20, 2023Updated 2 years ago
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆138Mar 12, 2026Updated last month
- ☆16Jan 30, 2024Updated 2 years ago
- [ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance☆17Sep 11, 2024Updated last year
- ☆18Jul 16, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Oct 20, 2023Updated 2 years ago
- This is the official repo of OpenSatMap in NeurIPS 2024 D&B Track☆29Jul 6, 2025Updated 9 months ago
- ☆47Jun 22, 2024Updated last year
- ☆35Oct 1, 2024Updated last year
- [CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".☆32May 12, 2025Updated 11 months ago
- ☆21Aug 8, 2024Updated last year
- ☆30Aug 14, 2023Updated 2 years ago
- Weakly Supervised Posture Mining with Reverse Cross-entropy for Fine-grained Classification☆27May 31, 2023Updated 2 years ago
- [ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings☆11Feb 24, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆23May 8, 2025Updated 11 months ago
- GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding☆84May 10, 2025Updated 11 months ago
- Winning 3rd Place solution for HubMap - Hacking the Human Vasculature hosted on Kaggle☆14Aug 10, 2023Updated 2 years ago
- ☆38Mar 28, 2025Updated last year
- Official Implementation of Attentive Mask CLIP (ICCV2023, https://arxiv.org/abs/2212.08653)☆36May 29, 2024Updated last year
- VGI-Enhanced multimodal large language model for remote sensing images.☆187Mar 4, 2025Updated last year
- ☆39Jan 12, 2026Updated 3 months ago
- ☆25Mar 26, 2026Updated 2 weeks ago
- ☆37Jan 20, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆12May 23, 2024Updated last year
- Learning Causal Alignment For Reliable Disease Diagnosis (ICLR 2025)☆16Jun 17, 2025Updated 9 months ago
- MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis☆17Jun 13, 2025Updated 10 months ago
- [CVPR 2025] Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media Manipulation☆33Jul 18, 2025Updated 8 months ago
- ACMMM 2025☆17Dec 11, 2025Updated 4 months ago
- Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective☆15Oct 22, 2024Updated last year
- CausalDynamics: A large-scale benchmark for structural discovery of dynamical causal models☆31Oct 6, 2025Updated 6 months ago
- [CVPR 2025] This repository is intended to store the code and data for ASAP (Advancing Semantic Alignment Promotes Multi-Modal Manipulati…☆20Jun 18, 2025Updated 9 months ago
- This repository contains the implementation of the method described in our paper, "Divide and Conquer: Isolating Normal-Abnormal Attribut…☆11Apr 9, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated 10 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆99Mar 26, 2025Updated last year
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"☆33Oct 12, 2024Updated last year
- Extract video features. Currently, the models includes I3D, will be continuously updated.☆12Jun 4, 2020Updated 5 years ago
- ☆16Mar 17, 2025Updated last year
- 模仿 TensorFlow 写的极简深度学习框架,仅供练习目的☆14Sep 11, 2018Updated 7 years ago
- Towards Defending against Adversarial Examples via Attack-Invariant Features☆12Oct 12, 2023Updated 2 years ago