CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
☆38Mar 21, 2025Updated 11 months ago
Alternatives and similar repositories for CompreCap
Users that are interested in CompreCap are comparing it to the libraries listed below
Sorting:
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Mar 16, 2025Updated 11 months ago
- [NeurIPS 2024] Official PyTorch implementation of LoTLIP: Improving Language-Image Pre-training for Long Text Understanding☆50Jan 14, 2025Updated last year
- [ICCV 2023] HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation☆37Jan 25, 2024Updated 2 years ago
- NeuSyRE: A Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment☆22Mar 10, 2024Updated last year
- ☆65Nov 7, 2024Updated last year
- Fast Contextual Scene Graph Generation with Unbiased Context Augmentation☆12Aug 7, 2023Updated 2 years ago
- [ACL 2023] Transforming Visual Scene Graphs to Image Captions☆10Dec 13, 2023Updated 2 years ago
- Implementation of the Paper Scene-Graph ViT☆10Dec 20, 2024Updated last year
- ☆24Jun 12, 2025Updated 8 months ago
- Removing Cost Volumes from Optical Flow Estimators (ICCV 2025 Oral)☆33Dec 2, 2025Updated 2 months ago
- Official PyTorch implementation of the paper Transformer-Based Image Generation from Scene Graphs https://arxiv.org/abs/2303.04634☆19Jan 30, 2024Updated 2 years ago
- ☆14May 16, 2023Updated 2 years ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucination☆19Jan 27, 2025Updated last year
- ☆18Apr 20, 2025Updated 10 months ago
- [WACV 2025] Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge☆38Oct 29, 2024Updated last year
- [ECCV 2024] Official PyTorch implementation of DreamLIP: Language-Image Pre-training with Long Captions☆138May 8, 2025Updated 9 months ago
- ☆16Jun 11, 2021Updated 4 years ago
- [CVPR 2025 Highlight] Official Pytorch codebase for paper: "Assessing and Learning Alignment of Unimodal Vision and Language Models"☆57Aug 15, 2025Updated 6 months ago
- [3DV 2025] Learning Naturally Aggregated Appearance for Efficient 3D Editing