PKU-ICST-MIPL/Finedefics_ICLR2025

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PKU-ICST-MIPL/Finedefics_ICLR2025)

PKU-ICST-MIPL / Finedefics_ICLR2025

☆94

Alternatives and similar repositories for Finedefics_ICLR2025

Users that are interested in Finedefics_ICLR2025 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PKU-ICST-MIPL / TARA_CVPR2026
View on GitHub
☆17Mar 21, 2026Updated 4 months ago
PKU-ICST-MIPL / FineR1_ICLR2026
View on GitHub
☆68Apr 4, 2026Updated 3 months ago
ExplainableML / flair
View on GitHub
[CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations
☆148Mar 12, 2026Updated 4 months ago
Timsty1 / FineCLIP
View on GitHub
FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)
☆38Nov 12, 2025Updated 8 months ago
tiiuae / FineLIP
View on GitHub
code for FineLIP
☆43Nov 25, 2025Updated 8 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
HaiyangZheng / TextGCD
View on GitHub
(ECCV2024) Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery (TextGCD)
☆23Nov 26, 2025Updated 8 months ago
PKU-ICST-MIPL / MARS_TCSVT2021
View on GitHub
☆12Feb 2, 2023Updated 3 years ago
PKU-ICST-MIPL / DyFo_CVPR2025
View on GitHub
☆116Aug 14, 2025Updated 11 months ago
SarahRastegar / SelEx
View on GitHub
Official Repository of "SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery" (ECCV 2024)
☆31Aug 4, 2025Updated 11 months ago
ChenAnno / SPIRIT_TOMM2024
View on GitHub
Official implementation for "SPIRIT: Style-guided Patch Interaction for Fashion Image Retrieval with Text Feedback"
☆16Oct 27, 2025Updated 9 months ago
HaiyangZheng / PHE
View on GitHub
(NeurIPS2024) Prototypical Hash Encoding for On-the-Fly Fine-Grained Category Discovery (PHE)
☆16Oct 1, 2025Updated 9 months ago
Baichenjia / COPO
View on GitHub
Online Preference Alignment for Language Models via Count-based Exploration
☆21Jan 14, 2025Updated last year
ChenAnno / FashionERN_AAAI2024
View on GitHub
Official implementation for "FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval"
☆20Oct 27, 2025Updated 9 months ago
zjunlp / Deco
View on GitHub
[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation
☆147Sep 11, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
XMUDeepLIT / UME-R1
View on GitHub
The code implementation for UME-R1: Exploring Reasoning-Driven Generative Multimodal Embeddings (ICLR 2026).
☆70Feb 25, 2026Updated 5 months ago
DTennant / Incremental-Generalized-Category-Discovery
View on GitHub
☆15Oct 27, 2023Updated 2 years ago
ChenAnno / Real20M_ACMMM2023
View on GitHub
Official implementation for "Real20M: A Large-scale E-commerce Dataset for Cross-domain Retrieval"
☆25Oct 27, 2025Updated 9 months ago
racinmat / GTAVisionExport-postprocessing
View on GitHub
☆11Jan 27, 2020Updated 6 years ago
shiming-chen / LaZSL
View on GitHub
Official implementations of our LaZSL (ICCV'25)
☆45Jul 13, 2025Updated last year
raghavlite / B3
View on GitHub
☆43Jan 12, 2026Updated 6 months ago
enguangW / GET
View on GitHub
GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery （CVPR2025）
☆37Mar 31, 2025Updated last year
OatmealLiu / FineR
View on GitHub
[ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models
☆189Jul 15, 2024Updated 2 years ago
NVlabs / QLIP
View on GitHub
[arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation
☆97Mar 1, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
inclusionAI / Zooming-without-Zooming
View on GitHub
[ICML 2026] ZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark
☆181May 4, 2026Updated 2 months ago
MikeWangWZHL / PAPO
View on GitHub
Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"
☆153Feb 4, 2026Updated 5 months ago
VisionOPD / Vision-OPD
View on GitHub
Vision-OPD is a regional-to-global on-policy self-distillation framework that transfers a model's own privileged crop-conditioned percept…
☆222Jul 17, 2026Updated last week
PKU-ICST-MIPL / Venus_CVPR2026
View on GitHub
☆145Mar 10, 2026Updated 4 months ago
Event-AHU / Neuromorphic_ReID
View on GitHub
[AAAI 2026] Towards high-level person/vehicle re-id using an event camera
☆18Jul 10, 2026Updated 2 weeks ago
XMUDeepLIT / LLaVE
View on GitHub
LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning
☆78May 23, 2025Updated last year
SSDUT-Caiyq / UFG-NCD
View on GitHub
(CVPR2024 Highlight) Novel Class Discovery for Ultra-Fine-Grained Visual Categorization (UFG-NCD)
☆24Jul 1, 2024Updated 2 years ago
xjjxmu / TextRefiner
View on GitHub
The official code for "TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning" | [AAAI2025]
☆53Mar 13, 2025Updated last year
MPI-Lab / MLLM4Text-ReID
View on GitHub
Code for Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID (CVPR 2024)
☆91Jul 13, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
360CVGroup / Inner-Adaptor-Architecture
View on GitHub
LMM solved catastrophic forgetting, AAAI2025
☆45Apr 15, 2025Updated last year
shufangxun / LLaVA-MoD
View on GitHub
[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation
☆227Mar 31, 2025Updated last year
yu-rp / VisualPerceptionToken
View on GitHub
☆136Mar 22, 2025Updated last year
pritamqu / HALVA
View on GitHub
[ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination
☆21Jan 27, 2025Updated last year
Liuziyu77 / Visual-RFT
View on GitHub
Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
☆2,262Oct 29, 2025Updated 9 months ago
chs20 / fuselip
View on GitHub
FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens
☆17Sep 8, 2025Updated 10 months ago
MCG-NJU / FreeRet
View on GitHub
[ICML2026] FreeRet: MLLMs as Training-Free Retrievers
☆22May 25, 2026Updated 2 months ago