LuFan31/CompreCap

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LuFan31/CompreCap)

LuFan31 / CompreCap

CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

☆39

Alternatives and similar repositories for CompreCap

Users that are interested in CompreCap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LuFan31 / ET-OOD
View on GitHub
CVPR2023:Uncertainty-Aware Optimal Transport for Semantically Coherent Out-of-Distribution Detection
☆26Mar 27, 2023Updated 3 years ago
VQAssessment / Q-Bench
View on GitHub
An archived version of Q-Bench. We will make updates in https://github.com/q-future/Q-Bench in the future.
☆12Nov 16, 2023Updated 2 years ago
ant-research / DreamLIP
View on GitHub
[ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions
☆138May 8, 2025Updated last year
IceTTTb / NopeSAC
View on GitHub
[TPAMI'23] NOPE-SAC: Neural One-Plane RANSAC for Sparse-View Planar 3D Reconstruction
☆71Sep 10, 2023Updated 2 years ago
wuw2019 / R-AMT
View on GitHub
☆20Oct 19, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
franciszzj / HiLo
View on GitHub
[ICCV 2023] HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation
☆38Jan 25, 2024Updated 2 years ago
felixcheng97 / AGAP
View on GitHub
[3DV 2025] Learning Naturally Aggregated Appearance for Efficient 3D Editing
☆33Feb 13, 2025Updated last year
pranoyr / scene-graph-vit
View on GitHub
Implementation of the Paper Scene-Graph ViT
☆10Dec 20, 2024Updated last year
GaryJiajia / TSG
View on GitHub
[ACL 2023] Transforming Visual Scene Graphs to Image Captions
☆10Dec 13, 2023Updated 2 years ago
moshuilanting / fast-context-scene-graph-generation
View on GitHub
Fast Contextual Scene Graph Generation with Unbiased Context Augmentation
☆11Aug 7, 2023Updated 2 years ago
chaimi2013 / FPCNet
View on GitHub
Fully Point-wise Convolutional Neural Network
☆11Dec 30, 2019Updated 6 years ago
calmke / LiPMAP
View on GitHub
[TPAMI 2026] Interacted Planes Reveal 3D Line Mapping
☆55Apr 17, 2026Updated 3 months ago
rafa-cxg / PySGG-cxg
View on GitHub
☆14May 16, 2023Updated 3 years ago
SHTUPLUS / Pix2Grp_CVPR2024
View on GitHub
☆71Nov 7, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
qianmingduowan / Sat2Density
View on GitHub
The official implementation of the paper: Sat2Density: Faithful Density Learning from Satellite-Ground Image Pairs (ICCV 2023)
☆59Feb 4, 2026Updated 5 months ago
lezhang7 / SAIL
View on GitHub
[CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"
☆60Aug 15, 2025Updated 11 months ago
raghavlite / B3
View on GitHub
☆43Jan 12, 2026Updated 6 months ago
Xiaohao-Xu / Ambiguity-in-Space
View on GitHub
[ECCV 2026] One Scene, Two Depths: Probing Geometric Ambiguity in Monocular Foundation Models (Layered 3D Spatial Understanding)
☆23Jul 10, 2026Updated last week
lck666666 / plana3r
View on GitHub
[NeurIPS 2025] the official project page of a paper, "PLANA3R: Zero-shot Metric Planar 3D Reconstruction via Feed-Forward Planar Splattin…
☆74May 4, 2026Updated 2 months ago
chaimi2013 / NighttimeDehaze
View on GitHub
Nighttime Haze Removal Based on a New Imaging Model
☆13Dec 30, 2019Updated 6 years ago
ant-research / scalelsd
View on GitHub
[CVPR 2025] ScaleLSD: Scalable Deep Line Segment Detection Streamlined
☆63Sep 25, 2025Updated 9 months ago
wangqixun / mfpsg
View on GitHub
mask2former psg
☆22Dec 12, 2022Updated 3 years ago
visinf / veto
View on GitHub
Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)
☆22Mar 23, 2026Updated 3 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
qishisuren123 / AnyCap
View on GitHub
A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…
☆54Jul 24, 2025Updated 11 months ago
HKUST-LongGroup / RECODE
View on GitHub
[NeurIPS 2023] Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models
☆23Oct 21, 2025Updated 9 months ago
ZhuGeKongKong / SGG-G2S
View on GitHub
☆21Mar 1, 2022Updated 4 years ago
gpt4vision / OvSGTR
View on GitHub
[ECCV 2024 Best Paper Candidate] Implementation of "Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Vi…
☆104Jul 27, 2025Updated 11 months ago
pritamqu / HALVA
View on GitHub
[ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination
☆21Jan 27, 2025Updated last year
zkcys001 / CFD
View on GitHub
Open-source strong baseline for domain generlization re-ID. We will udpate the strong baseline and CFD method~
☆10Nov 30, 2021Updated 4 years ago
sayaknag / unbiasedSGG
View on GitHub
Official Pytorch Implementation of the framework TEMPURA proposed in our paper Unbiased Scene Graph Generation in Videos accepted by CVPR…
☆25Sep 9, 2025Updated 10 months ago
zhukaii / SPPR
View on GitHub
Implementation of the paper "Self-Promoted Prototype Refinement for Few-Shot Class-Incremental Learning"
☆89Oct 28, 2022Updated 3 years ago
Kien085 / SG2Caps
View on GitHub
☆23Aug 21, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
zhangce01 / HiKER-SGG
View on GitHub
[CVPR 2024] Code for HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation
☆77Oct 11, 2024Updated last year
zc-alexfan / render_mano_ih
View on GitHub
☆12Dec 29, 2021Updated 4 years ago
Xianpeng919 / mvacon
View on GitHub
☆11Jun 17, 2024Updated 2 years ago
harrytea / ROOT
View on GitHub
ROOT: VLM based System for Indoor Scene Understanding and Beyond
☆42Jan 22, 2025Updated last year
donglaiw / AoT_Dataset
View on GitHub
CVPR18: Learning and Using the Arrow of Time
☆40Feb 11, 2022Updated 4 years ago
HKUST-LongGroup / Relation-R1
View on GitHub
[AAAI 2026] Relation-R1: Progressively Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relation Comprehension
☆20Mar 6, 2026Updated 4 months ago
zliucz / animate-your-word
View on GitHub
[ICCV'25 Best Paper Candidate] Official Implementations for Paper: Dynamic Typography: Bringing Text to Life via Video Diffusion Prior
☆353Nov 11, 2025Updated 8 months ago