Row11n/Prova

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Row11n/Prova)

Row11n / Prova

[AAAI-25] Official repository of "Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection"

☆20

Alternatives and similar repositories for Prova

Users that are interested in Prova are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ShareLab-SII / CoMP-MM
View on GitHub
Official repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"
☆48Apr 3, 2025Updated last year
wdrink / OpenTokenizer
View on GitHub
☆21Jan 17, 2025Updated last year
inst-it / inst-it
View on GitHub
[NeurIPS 2025] The official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tun…
☆40Feb 20, 2025Updated last year
ShareLab-SII / UniAR
View on GitHub
[ICML 2026] The official implementation of paper "Unified Multimodal Autoregressive Modeling with Shared Context—Visual Tokenizer is Key …
☆46Jul 13, 2026Updated last week
HenryYu23 / DAS
View on GitHub
☆12Mar 13, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wjpoom / SPEC
View on GitHub
[CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"
☆52Jun 16, 2025Updated last year
MengLcool / DeepStack-VL
View on GitHub
[NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…
☆93Jun 17, 2024Updated 2 years ago
wdrink / ARM
View on GitHub
ARM: An AutoRegressive Large Multimodal Model with Discrete Representations
☆50Jun 10, 2026Updated last month
mlvlab / RALF
View on GitHub
Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".
☆47Sep 12, 2024Updated last year
TencentYoutuResearch / SPEAR
View on GitHub
An RL Recipe for Building Agentic LLMs via Self-Imitation on Long-Horizon Agentic Tasks
☆39Jan 30, 2026Updated 5 months ago
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
MengLcool / Magic-Pencil
View on GitHub
implementation of "Combining Sketch and Tone for Pencil Drawing Production"
☆16May 16, 2019Updated 7 years ago
HVision-NKU / ControlSR
View on GitHub
☆13Apr 19, 2025Updated last year
FishAndWasabi / Real-LOD
View on GitHub
Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"
☆34Apr 20, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jessemelpolio / AnytimeCL
View on GitHub
[ECCV'24 Oral] Anytime Continual Learning for Open Vocabulary Classification
☆24Oct 17, 2024Updated last year
roywang021 / EOD
View on GitHub
Code for AAAI2024 paper: Towards Evidential and Class Separable Open Set Object Detection
☆12Dec 23, 2023Updated 2 years ago
Jiaxing-star / LLaVA-Octopus
View on GitHub
☆11Jan 8, 2025Updated last year
mecarill / classawareteacher
View on GitHub
☆17Jun 28, 2024Updated 2 years ago
VIML-CVDL / Object-Detection-in-Foggy-Scenes
View on GitHub
☆12Dec 21, 2022Updated 3 years ago
MengLcool / SEGIC
View on GitHub
[ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".
☆27Oct 13, 2024Updated last year
X2FD / LVIS-INSTRUCT4V
View on GitHub
☆134Dec 22, 2023Updated 2 years ago
tchittesh / fovea
View on GitHub
Code for FOVEA: Foveated Image Magnification for Autonomous Navigation (ICCV 2021)
☆15Jul 13, 2022Updated 4 years ago
sinwang20 / D2PO
View on GitHub
[ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…
☆18Jul 22, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
kaist-ami / BEAF
View on GitHub
[ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"
☆22Mar 26, 2025Updated last year
gaoyingjay / PS-TTL
View on GitHub
This is the implementation of the paper “PS-TTL: Prototype-based Soft-labels and Test-Time Learning for Few-shot Object Detection” (MM 20…
☆30Jan 30, 2026Updated 5 months ago
FLAIROx / popjym
View on GitHub
POPGym Library in JAX
☆14Apr 15, 2024Updated 2 years ago
lzyhha / AODRaw-mmdetection
View on GitHub
(CVPR 2025 Highlight) Official repository of paper "AODRaw: Towards RAW Object Detection in Diverse Conditions" (https://arxiv.org/pdf/24…
☆24Apr 6, 2025Updated last year
HVision-NKU / GlimpsePrune
View on GitHub
[TCSVT] Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"
☆98Jun 12, 2026Updated last month
yuhangzang / OV-DETR
View on GitHub
[Under preparation] Code repo for "Open-Vocabulary DETR with Conditional Matching" (ECCV 2022)
☆240Aug 3, 2022Updated 3 years ago
viiika / Prism
View on GitHub
[ICML 2026] Official Implementation of Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diff…
☆21Mar 4, 2026Updated 4 months ago
DongSky / MR-GDINO
View on GitHub
☆54Dec 23, 2024Updated last year
julian-8897 / hyperbolic-latent-vae
View on GitHub
Variational Autoencoder with non-euclidean (hyperbolic) latent space
☆14Nov 25, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nis-research / afa-augment
View on GitHub
☆24Nov 11, 2024Updated last year
vita-epfl / butterflydetector
View on GitHub
☆10Apr 20, 2023Updated 3 years ago
QizaoWang / CAMC-CCReID
View on GitHub
Co-Attention Aligned Mutual Cross-Attention for Cloth-Changing Person Re-Identification [ACCV 2022 Oral]
☆17Dec 26, 2024Updated last year
xiaoshideta / MixPrompt
View on GitHub
(NeurIPS 2025) MixPrompt: Efficient Mixed Prompting for Multimodal Semantic Segmentation
☆17Mar 12, 2026Updated 4 months ago
HVision-NKU / TempSamp-R1
View on GitHub
[Official, NeurIPS 2025] TempSamp-R1: Effective Temporal Sampling with Reinforcement Fine-Tuning for Video LLMs.
☆17Jun 8, 2026Updated last month
ChenHsing / VIDiff
View on GitHub
☆39Dec 4, 2023Updated 2 years ago
hhhyyeee / Hybrid-TTA
View on GitHub
[ICCV 2025] Hybrid-TTA: Continual Test-time Adaptation via Dynamic Domain Shift Detection
☆16Jan 14, 2026Updated 6 months ago