QUVA-Lab/PIN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QUVA-Lab/PIN)

QUVA-Lab / PIN

Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs

☆26

Alternatives and similar repositories for PIN

Users that are interested in PIN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jkli1998 / T-CAR
View on GitHub
Code for paper 'Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction' （TOMM 2023）
☆10Sep 6, 2025Updated 10 months ago
ivonajdenkoska / tulip
View on GitHub
[ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"
☆32Jan 26, 2026Updated 5 months ago
SarahRastegar / InfoSieve
View on GitHub
Official Repository of "Learn to Categorize or Categorize to Learn? Self-Coding for Generalized Category Discovery" (NeurIPS 2023)
☆23Aug 4, 2025Updated 11 months ago
SarahRastegar / Open-World-Papers
View on GitHub
A comprehensive collection of open world papers from top tier conferences and journals
☆25Dec 27, 2024Updated last year
SarahRastegar / SelEx
View on GitHub
Official Repository of "SelEx: Self-Expertise in Fine-Grained Generalized Category Discovery" (ECCV 2024)
☆31Aug 4, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WalterSimoncini / fungivision
View on GitHub
Library implementation of "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"
☆40Oct 31, 2024Updated last year
jialuli-luka / Video-MSG
View on GitHub
Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization
☆28Apr 14, 2025Updated last year
lxasqjc / MCPL
View on GitHub
MCPL: MULTI-CONCEPT PROMPT LEARNING
☆20May 27, 2024Updated 2 years ago
ImKeTT / ZeroGen
View on GitHub
[NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation
☆14Oct 7, 2023Updated 2 years ago
mdivyanshu97 / DISCOVR
View on GitHub
☆15Nov 20, 2025Updated 8 months ago
FrankFundel / SGCond
View on GitHub
☆10Jun 28, 2023Updated 3 years ago
mightyzau / InfMLLM
View on GitHub
☆19Dec 6, 2023Updated 2 years ago
causalNLP / amr_llm
View on GitHub
This repo explores how AMR to address tasks difficult for LLMs
☆13Jan 15, 2024Updated 2 years ago
Surrey-UP-Lab / AV-GS
View on GitHub
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis
☆14Oct 3, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
liujiyuan13 / MVTecAD
View on GitHub
A Pytorch loader for MVTecAD dataset.
☆11Dec 27, 2021Updated 4 years ago
jkli1998 / DRM
View on GitHub
Code for paper 'Leveraging Predicate and Triplet Learning for Scene Graph Generation'. (CVPR 2024)
☆33Sep 6, 2025Updated 10 months ago
shgaurav1 / DVG
View on GitHub
Diverse Video Generation using a Gaussian Process Trigger
☆18Dec 13, 2022Updated 3 years ago
showlab / MovieSeq
View on GitHub
[ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences
☆46Mar 11, 2025Updated last year
forgi86 / lru-reduction
View on GitHub
Python code of the paper Model order reduction of deep structured state-space models: A system-theoretic approach
☆14Nov 22, 2024Updated last year
geoalgo / syne-tune
View on GitHub
Optimizing Hyperparameters with Conformal Quantile Regression
☆11May 22, 2023Updated 3 years ago
cloneofsimo / repa-rf
View on GitHub
☆32Nov 4, 2024Updated last year
ethanlshen / HierNet
View on GitHub
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…
☆23Nov 8, 2023Updated 2 years ago
masoudpz / AVID-Adversarial-Visual-Irregularity-Detection
View on GitHub
AVID: Adversarial Visual Irregularity Detection
☆12Oct 27, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
anguyen8 / vision-llms-are-blind
View on GitHub
☆143May 25, 2026Updated last month
VincentDENGP / 3D-LR
View on GitHub
Can 3D Vision-Language Models Truly Understand Natural Language?
☆20Mar 28, 2024Updated 2 years ago
zeyofu / Commonsense-T2I
View on GitHub
Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]
☆24Aug 13, 2024Updated last year
ExplainableML / ImageFreeZSL
View on GitHub
☆18Oct 5, 2024Updated last year
msmathcomp / hyperbolic-tsne
View on GitHub
Experiments and content for the "Accelerating hyperbolic t-SNE" paper.
☆19Apr 30, 2026Updated 2 months ago
Qinying-Liu / TagAlign
View on GitHub
Official implementation of TagAlign
☆37Dec 11, 2024Updated last year
Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking
View on GitHub
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
☆33May 16, 2024Updated 2 years ago
BerasiDavide / vlm_image_compositionality
View on GitHub
[CVPR'25] Official implementation of the paper "Not Only Text: Exploring Compositionality of Visual Representations in Vision-Language Mo…
☆18Nov 21, 2025Updated 8 months ago
junha1125 / Domain-Adaptation-Generalization-in-ECCV-2024
View on GitHub
☆16Sep 29, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AlbertiPot / nar
View on GitHub
codes for Neural Architecture Ranker and detailed cell information datasets based on NAS-Bench series
☆12Jul 11, 2022Updated 4 years ago
salgadev / private_nlp
View on GitHub
Natural Language Processing models using private and secure data. Powered by OpenMined's tools PySyft and SyferText.
☆11Feb 11, 2021Updated 5 years ago
mvp-ai-lab / FreeScale
View on GitHub
The official implementation of our CVPR 2026 paper: "FreeScale: Scaling 3D Scenes via Certainty-Aware Free-View Generation"
☆20May 17, 2026Updated 2 months ago
baaaad / ECE
View on GitHub
[ECCV'22 Poster] Explicit Image Caption Editing
☆22Nov 30, 2022Updated 3 years ago
AdaptiveMotorControlLab / AROS
View on GitHub
💍
☆26Feb 3, 2025Updated last year
xizaoqu / MOFT
View on GitHub
[Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller
☆51Aug 5, 2025Updated 11 months ago
JHU-CLSP / turking-bench
View on GitHub
Web-grounded natural language instructions
☆18Nov 25, 2024Updated last year