SivanDoveh/IPLoc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SivanDoveh/IPLoc)

SivanDoveh / IPLoc

Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples

☆40

Alternatives and similar repositories for IPLoc

Users that are interested in IPLoc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HashmatShadab / HSAT
View on GitHub
[MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology
☆12Jun 17, 2025Updated last year
fahadshamshad / deep-facial-privacy-prior
View on GitHub
[ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".
☆12Oct 11, 2024Updated last year
HashmatShadab / MambaRobustness
View on GitHub
[CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"
☆26Jun 8, 2025Updated last year
danielchyeh / this-is-my
View on GitHub
Official This-Is-My Dataset published in CVPR 2023
☆16Jul 18, 2024Updated 2 years ago
akhtarvision / bpc_calibration
View on GitHub
[CVPR 2023] Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection
☆31Jun 21, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Space3D-Bench / Space3D-Bench
View on GitHub
☆12Apr 18, 2025Updated last year
mzeeshankaramat / SafeAgents
View on GitHub
☆20Jun 4, 2026Updated last month
snap-research / MyVLM
View on GitHub
Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)
☆188Jul 5, 2024Updated 2 years ago
zzzx1224 / Beyond-model-adaptation-at-test-time-Papers
View on GitHub
☆51Nov 7, 2024Updated last year
akhtarvision / weather-regional
View on GitHub
☆11Oct 29, 2024Updated last year
GaryJiajia / OFv2_ICL_VQA
View on GitHub
[CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering
☆21May 28, 2025Updated last year
dvirsamuel / PDM
View on GitHub
Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".
☆14Feb 26, 2025Updated last year
k1rezaei / Text-to-concept
View on GitHub
☆36Feb 5, 2024Updated 2 years ago
hananshafi / MedContext
View on GitHub
[MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"
☆14Nov 1, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Muhammad-Ibraheem-Siddiqui / PerSense
View on GitHub
[BMVC 2025] Official Implementation of the paper "PerSense: Personalized Instance Segmentation in Dense Images"
☆31Dec 18, 2025Updated 7 months ago
CUHK-AIM-Group / CLIFF
View on GitHub
[ECCV' 24 Oral] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection
☆31Sep 26, 2024Updated last year
Stevetich / EventHallusion
View on GitHub
EventHallusion: Diagnosing Event Hallucinations in Video LLMs
☆34Aug 5, 2025Updated 11 months ago
NUS-HPC-AI-Lab / Multimodal-ICL-Retriever
View on GitHub
☆10Nov 12, 2024Updated last year
mbzuai-oryx / CVRR-Evaluation-Suite
View on GitHub
[CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…
☆50Aug 23, 2024Updated last year
GasolSun36 / SURf
View on GitHub
[EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information
☆11Oct 11, 2024Updated last year
hananshafi / MTL-ViT
View on GitHub
A new multi-task learning framework using Vision Transformers
☆11Jun 19, 2024Updated 2 years ago
rohit901 / VANE-Bench
View on GitHub
[NAACL'25] Contains code and documentation for our VANE-Bench paper.
☆24Aug 19, 2025Updated 11 months ago
ChengHan111 / VPT-or-FT
View on GitHub
Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)
☆13Mar 8, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
muzairkhattak / transformers-transforming-vision
View on GitHub
Validating image classification benchmark results on ViTs and ResNets (v2)
☆13Nov 3, 2022Updated 3 years ago
OmkarThawakar / STT-UNET
View on GitHub
3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers
☆14Apr 17, 2023Updated 3 years ago
amandpkr / GMNR
View on GitHub
(ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.
☆18Sep 28, 2023Updated 2 years ago
gefend / LIMITR
View on GitHub
Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representation
☆17Updated this week
OpenGVLab / LLMPrune-BESA
View on GitHub
BESA is a differentiable weight pruning technique for large language models.
☆17Mar 4, 2024Updated 2 years ago
FreedomIntelligence / TRIM
View on GitHub
We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…
☆22Jan 11, 2026Updated 6 months ago
mbzuai-oryx / ALM-Bench
View on GitHub
[CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…
☆47May 26, 2025Updated last year
koushiksrivats / robust-concept-erasing
View on GitHub
[⭐ CVPR 2025 Highlight ⭐] Official Implementation of the paper STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing fro…
☆31Apr 22, 2025Updated last year
techmn / cosnet
View on GitHub
A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes (WACV 2025)
☆12Aug 11, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Shelton1013 / Chain_of_Attack
View on GitHub
[CVPR'25]Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks
☆32Jun 12, 2025Updated last year
iabh1shekbasu / CalibPrompt
View on GitHub
[BMVC 2025 🔥] CalibPrompt is the first framework that enhances Med-VLM calibration during prompt tuning.
☆16Jul 13, 2026Updated last week
Muhammad-Huzaifaa / ObjectCompose
View on GitHub
[ACCV 2024] ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes 🚀🚀🚀
☆37Jan 21, 2025Updated last year
ffhibnese / CGNC_Targeted_Adversarial_Attacks
View on GitHub
[ECCV-2024] Transferable Targeted Adversarial Attack, CLIP models, Generative adversarial network, Multi-target attacks
☆39Apr 23, 2025Updated last year
ilkerkesen / ViLMA
View on GitHub
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)
☆16Jan 18, 2024Updated 2 years ago
zysxmu / DFSQ
View on GitHub
super-resolution; post-training quantization; model compression
☆14Nov 10, 2023Updated 2 years ago
WisconsinAIVision / YoLLaVA
View on GitHub
🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant (NeurIPS 2024)
☆123Mar 26, 2025Updated last year