☆93Mar 20, 2026Updated 2 months ago
Alternatives and similar repositories for Finedefics_ICLR2025
Users that are interested in Finedefics_ICLR2025 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A comprehensive collection of open world papers from top tier conferences and journals☆25Dec 27, 2024Updated last year
- ☆12Feb 2, 2023Updated 3 years ago
- We present **FOCI**, a benchmark for Fine-grained Object ClassIfication for large vision language models (LVLMs).☆19Jun 21, 2024Updated last year
- (ECCV2024) Textual Knowledge Matters: Cross-Modality Co-Teaching for Generalized Visual Class Discovery (TextGCD)☆23Nov 26, 2025Updated 6 months ago
- Official implementation for "FashionERN: Enhance-and-Refine Network for Composed Fashion Image Retrieval"☆20Oct 27, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Oct 27, 2023Updated 2 years ago
- ☆11Jan 27, 2020Updated 6 years ago
- FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens☆17Sep 8, 2025Updated 9 months ago
- ☆17May 2, 2024Updated 2 years ago
- The official repo for the DanQing dataset.☆36Mar 25, 2026Updated 2 months ago
- ☆27Oct 11, 2024Updated last year
- [IJCV 2025] The official implementation of "AnyPattern: Towards In-context Image Copy Detection"☆11Oct 24, 2025Updated 7 months ago
- [ICLR 2026] SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models☆96Jun 9, 2026Updated last week
- 自动调制识别(AMR)☆20Nov 16, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- FineCLIP: Self-distilled Region-based CLIP for Better Fine-grained Understanding (NIPS24)☆38Nov 12, 2025Updated 7 months ago
- GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery (CVPR2025)☆36Mar 31, 2025Updated last year
- [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations☆142Mar 12, 2026Updated 3 months ago
- XCon: Learning with Experts for Fine-grained Category Discovery☆19Dec 19, 2022Updated 3 years ago
- Distribution-Aware Binarization of Neural Networks for Sketch Recognition - WACV 18☆11Jul 5, 2019Updated 6 years ago
- LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning☆78May 23, 2025Updated last year
- LMM solved catastrophic forgetting, AAAI2025☆45Apr 15, 2025Updated last year
- ☆35Apr 9, 2025Updated last year
- The implementation of our NeurIPS 2024 paper "DarkSAM: Fooling Segment Anything Model to Segment Nothing".☆14Nov 4, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models☆15Nov 1, 2024Updated last year
- [NeurIPS2025 Spotlight 🔥 ] Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Langu…☆275Nov 5, 2025Updated 7 months ago
- Official repository for "LFR-GAN: Local Feature Refinement based Generative Adversarial Network for Text-to-Image Generation" (TOMM 2023)…☆11Mar 21, 2023Updated 3 years ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆146Sep 11, 2025Updated 9 months ago
- Source code of our TCSVT 2020 paper "Multi-level Knowledge Injecting for Visual Commonsense Reasoning"☆11Sep 18, 2024Updated last year
- The official implementation of RAR☆91Dec 9, 2025Updated 6 months ago
- CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning☆38Mar 21, 2025Updated last year
- Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’☆2,247Oct 29, 2025Updated 7 months ago
- ☆11May 6, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Oct 25, 2024Updated last year
- ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs☆29Aug 15, 2025Updated 10 months ago
- Tis is code for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model (ACM MM 2024))☆12Aug 27, 2024Updated last year
- This is a repository for organizing codes related to re-identification (especially state-of-the-art reid methods).☆11Sep 15, 2021Updated 4 years ago
- code for paper: Simultaneous Image to Zero and Zero to Noise: Diffusion Models with Analytical Image Attenuation☆63May 7, 2026Updated last month
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆80May 31, 2025Updated last year
- MNIST files in PNG format☆18May 29, 2022Updated 4 years ago