JieyuZ2/ProVision

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JieyuZ2/ProVision)

JieyuZ2 / ProVision

A instruction data generation system for multimodal language models.

☆37

Alternatives and similar repositories for ProVision

Users that are interested in ProVision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JieyuZ2 / TaskMeAnything
View on GitHub
[NeurIPS 2024] A task generation and model evaluation system for multimodal language models.
☆71Nov 27, 2024Updated last year
jkli1998 / T-CAR
View on GitHub
Code for paper 'Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction' （TOMM 2023）
☆10Sep 6, 2025Updated 10 months ago
SalesforceAIResearch / LATTE
View on GitHub
☆70Jun 2, 2026Updated last month
RAIVNLab / mnms
View on GitHub
m&ms: A Benchmark to Evaluate Tool-Use for multi-step multi-modal tasks
☆46Sep 26, 2024Updated last year
limenlp / ExeVRM
View on GitHub
Official implementation for the paper "Video-Based Reward Modeling for Computer-Use Agents"
☆17Mar 14, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
salesforce / QVR-SimpleDLM
View on GitHub
Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.
☆16May 1, 2025Updated last year
RAIVNLab / sugar-crepe
View on GitHub
[NeurIPS 2023] A faithful benchmark for vision-language compositionality
☆93Feb 13, 2024Updated 2 years ago
limenlp / SEA
View on GitHub
Official Implementation for the paper "Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base"
☆27Sep 2, 2025Updated 10 months ago
Zhaoyang-Chu / code-unlearning
View on GitHub
This repository contains a PyTorch implementation of the ICSE'26 paper "Scrub It Out! Erasing Sensitive Memorization in Code Language Mod…
☆30Sep 18, 2025Updated 10 months ago
JackHck / SBCL
View on GitHub
[ICCV 2023] Subclass-balancing contrastive learning for long-tailed recognition
☆18Oct 30, 2023Updated 2 years ago
JackHck / MADAug
View on GitHub
[ICCV 2023] MADAug: When to Learn What: Model-Adaptive Data Augmentation Curriculum
☆20Nov 9, 2023Updated 2 years ago
jamespark3922 / SyntheticVG
View on GitHub
☆29Jun 12, 2025Updated last year
salesforce / PB-OVD
View on GitHub
A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels
☆65Jun 25, 2026Updated 3 weeks ago
weikaih04 / Synthetic-Detection-Segmentation-Grounding-Data
View on GitHub
[CVPR 2026] An accurate and dense-annotated synthetic dataset for training SOTA detectors / segmentors / Grounding-VLMs.
☆49Feb 23, 2026Updated 5 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
findalexli / mllm-dpo
View on GitHub
[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model
☆48Nov 10, 2024Updated last year
samschulter / omnilabeltools
View on GitHub
A Python toolkit for the OmniLabel benchmark providing code for evaluation and visualization
☆23Feb 1, 2025Updated last year
zzxslp / SoM-LLaVA
View on GitHub
[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs
☆145Aug 23, 2024Updated last year
VidCapBench / VidCapBench
View on GitHub
☆13May 17, 2025Updated last year
rlqja1107 / torch-ST-SGG
View on GitHub
Official PyTorch implementation Source code for Adaptive Self-Training Framework for Fine-grained Scene Graph generation (ST-SGG), accept…
☆22Jan 30, 2024Updated 2 years ago
kongdai123 / consistency2
View on GitHub
☆16Jun 14, 2024Updated 2 years ago
evendrow / face-reconstruction
View on GitHub
[Paper] Repository for “Realistic Face Reconstruction from Deep Embeddings," published in NeurIPS PriML 2021.
☆24Nov 16, 2022Updated 3 years ago
Yushi-Hu / tifa
View on GitHub
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
☆186Apr 29, 2024Updated 2 years ago
IIGROUP / PUM
View on GitHub
[CVPR 2021] Pytorch implementation for Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
☆19May 7, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
TrustGen / TrustEval-toolkit
View on GitHub
[ICLR'26, NAACL'25 Demo] Toolkit & Benchmark for evaluating the trustworthiness of generative foundation models.
☆132Aug 22, 2025Updated 11 months ago
HKUST-LongGroup / Relation-R1
View on GitHub
[AAAI 2026] Relation-R1: Progressively Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relation Comprehension
☆20Mar 6, 2026Updated 4 months ago
xiaoboxia / CoDis
View on GitHub
ICCV'2023: Combating Noisy Labels with Sample Selection by Mining High-Discrepancy Examples
☆12Oct 16, 2023Updated 2 years ago
simplelifetime / TIVE
View on GitHub
Less is More: High-value Data Selection for Visual Instruction Tuning
☆20Jan 18, 2025Updated last year
michaelofengenden / PPTArena
View on GitHub
Benchmark for Agentic Powerpoint Editing Tasks
☆21Jul 6, 2026Updated 2 weeks ago
dbolya / parc
View on GitHub
A benchmark suite for Scalable Diverse Model Selection for Accessible Transfer Learning from our NeurIPS 2021 paper.
☆15Dec 14, 2022Updated 3 years ago
aimagelab / CoDE
View on GitHub
[ECCV'24] Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities
☆52Jul 2, 2025Updated last year
ruc-datalab / SC-prompt
View on GitHub
☆12May 13, 2023Updated 3 years ago
allenai / molmoweb
View on GitHub
☆579Jun 26, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TIGER-AI-Lab / Mantis
View on GitHub
Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024 Best Paper]
☆239Jan 3, 2026Updated 6 months ago
zlab-princeton-internal / writing-guide
View on GitHub
Paper writing guide for Zhuang Liu Lab @ Princeton University
☆16Jun 24, 2026Updated 3 weeks ago
LinxinS97 / NLPBench
View on GitHub
NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models
☆10Oct 27, 2023Updated 2 years ago
agents-x-project / TIR-Bench
View on GitHub
[ECCV 2026] Official implementation of "TIR-Bench: A Comprehensive Benchmark for Agentic Thinking-with-Images Reasoning"
☆25Feb 8, 2026Updated 5 months ago
wyf23187 / Adaptive_Distractions
View on GitHub
NeurIPS 2025 Poster
☆26Feb 4, 2025Updated last year
awslabs / aws-cv-unique-information
View on GitHub
We define and estimate smooth unique information of samples with respect to classifier weights and predictions. We compute these quantiti…
☆11Mar 9, 2021Updated 5 years ago
maqqbu / MMSR
View on GitHub
The code for NeurIPS 2020 paper: Adversarial Crowdsourcing Through Robust Rank-One Matrix Completion.
☆10Oct 26, 2020Updated 5 years ago