mlvlab / SpeaQ
Official PyTorch implementation of "Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection" (CVPR 2024).
☆24Updated 5 months ago
Related projects: ⓘ
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".☆12Updated 5 months ago
- Code for paper 'Leveraging Predicate and Triplet Learning for Scene Graph Generation'. (CVPR 2024)☆22Updated 2 weeks ago
- VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation☆13Updated 3 months ago
- [CVPR 2024 Best paper award candidate] EGTR: Extracting Graph from Transformer for Scene Graph Generation☆55Updated 2 months ago
- state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆23Updated 5 months ago
- This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"☆18Updated last year
- This is a repository for listing papers on scene graph generation and application.☆20Updated last month
- ☆19Updated last year
- Implementation of "Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retent…☆11Updated last week
- ☆32Updated 5 months ago
- Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".☆32Updated 4 months ago
- The offical implemention of JM3D.☆27Updated 11 months ago
- Scene Graph Generate Zero Shot☆17Updated last year
- The official code for Devil's on the Edges: Selective Quad Attention for Scene Graph Generation, CVPR2023.☆22Updated last year
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Updated 4 months ago
- DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation☆7Updated 2 months ago
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆10Updated last year
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆13Updated 2 months ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆35Updated 11 months ago
- [ICCV 2023] HiLo: Exploiting High Low Frequency Relations for Unbiased Panoptic Scene Graph Generation☆28Updated 7 months ago
- Open-vocabulary Video Instance Segmentation Codebase built upon Detectron2, which is really easy to use.☆15Updated 6 months ago
- (ICCV 2023) Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation☆43Updated 2 months ago
- Large-Vocabulary Video Instance Segmentation dataset☆73Updated 2 months ago
- CVPR 2023 Accepted Paper HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models☆52Updated 6 months ago
- [ECCV 2024] ControlCap: Controllable Region-level Captioning☆49Updated last month
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆22Updated 10 months ago
- ☆20Updated 2 weeks ago
- Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)☆21Updated 11 months ago
- The official code for Relational Context Learning for Human-Object Interaction Detection, CVPR2023.☆48Updated last year
- ☆21Updated last year