[AAAI-25] Official repository of "Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection"
☆20Dec 27, 2024Updated last year
Alternatives and similar repositories for Prova
Users that are interested in Prova are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of "CoMP: Continual Multimodal Pre-training for Vision Foundation Models"☆46Apr 3, 2025Updated last year
- [NeurIPS 2025] The official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tun…☆40Feb 20, 2025Updated last year
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆84Jun 17, 2024Updated last year
- [CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"☆53Jun 16, 2025Updated 9 months ago
- UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation☆128Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆86May 4, 2025Updated 11 months ago
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆45Sep 12, 2024Updated last year
- Code for Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning☆16Apr 5, 2024Updated 2 years ago
- ☆10Jul 5, 2024Updated last year
- [ICCV 2025] Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction☆52Sep 22, 2025Updated 6 months ago
- Official repository of the paper "A Glimpse to Compress: Dynamic Visual Token Pruning for Large Vision-Language Models"☆92Feb 13, 2026Updated last month
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- The official implementation of dLLM-Var☆32Nov 6, 2025Updated 5 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆32Apr 20, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Apr 19, 2025Updated 11 months ago
- implementation of "Combining Sketch and Tone for Pencil Drawing Production"☆16May 16, 2019Updated 6 years ago
- ☆11Jan 8, 2025Updated last year
- ☆16Jun 28, 2024Updated last year
- [CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception☆23Jun 17, 2025Updated 9 months ago
- [ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".☆27Oct 13, 2024Updated last year
- ☆134Dec 22, 2023Updated 2 years ago
- Code for FOVEA: Foveated Image Magnification for Autonomous Navigation (ICCV 2021)☆15Jul 13, 2022Updated 3 years ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆22Mar 26, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 告诉你每门课的意义所在。☆22May 16, 2014Updated 11 years ago
- ☆64Apr 2, 2026Updated last week
- POPGym Library in JAX☆13Apr 15, 2024Updated last year
- [IROS 2024] Official code for Towards Dynamic and Small Objects Refinement for Unsupervised Domain Adaptative Nighttime Segmentation.☆16Jun 4, 2024Updated last year
- (CVPR 2025 Highlight) Official repository of paper "AODRaw: Towards RAW Object Detection in Diverse Conditions" (https://arxiv.org/pdf/24…☆24Apr 6, 2025Updated last year
- Delivery repo for users of Hauntimator tools☆19Mar 15, 2026Updated 3 weeks ago
- Unified Language-driven Zero-shot Domain Adaptation (CVPR 2024)☆17Nov 28, 2024Updated last year
- [ICCV 2025] Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation☆24Jul 31, 2025Updated 8 months ago
- A lightweight Inference Engine built for block diffusion models☆43Dec 9, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICCV2025] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectors☆26Jul 10, 2025Updated 8 months ago
- [ICCV 2025] UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions☆38Feb 10, 2026Updated last month
- [ICCV'2025] LawDIS: Language-Window-based Controllable Dichotomous Image Segmentation☆51Mar 8, 2026Updated last month
- ☆53Dec 23, 2024Updated last year
- Open-source red teaming framework for MLLMs with 42+ attack methods☆241Mar 25, 2026Updated 2 weeks ago
- Implementation of ReRAW: RGB-to-RAW Image Reconstruction via Stratified Sampling for Efficient Object Detection on the Edge☆29Sep 24, 2025Updated 6 months ago
- ☆16Sep 14, 2024Updated last year