zhjohnchan/SK-VG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhjohnchan/SK-VG)

zhjohnchan / SK-VG

[CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.

☆34

Alternatives and similar repositories for SK-VG

Users that are interested in SK-VG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cv516Buaa / OV-VG
View on GitHub
☆31Mar 25, 2024Updated 2 years ago
HKUST-KnowComp / VD-PCR
View on GitHub
Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"
☆10Nov 1, 2022Updated 3 years ago
Rubics-Xuan / IVG
View on GitHub
This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with H…
☆15May 21, 2024Updated 2 years ago
daqingliu / awesome-rec
View on GitHub
A curated list of research papers in Referring Expression Comprehension (REC)
☆46May 13, 2021Updated 5 years ago
ChopinSharp / ref-nms
View on GitHub
Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"
☆22Dec 20, 2020Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zyang-ur / ReSC
View on GitHub
Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020
☆90Sep 30, 2021Updated 4 years ago
hding2455 / DSC
View on GitHub
released code for CVPR2021: Deeply Shape-guided Cascade for Instance Segmentation
☆14Feb 20, 2022Updated 4 years ago
wh0330 / CAG_VisDial
View on GitHub
☆15Aug 13, 2020Updated 5 years ago
CCIIPLab / DPT
View on GitHub
The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering
☆20May 10, 2022Updated 4 years ago
ubc-vision / RefTR
View on GitHub
Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021
☆67May 26, 2022Updated 4 years ago
wangpengnorman / KB-Ref_dataset
View on GitHub
☆16Dec 28, 2020Updated 5 years ago
niejiahao1998 / MMRel
View on GitHub
☆31Nov 17, 2024Updated last year
MathLee / ICNet-for-RGBD-SOD
View on GitHub
[TIP2020] ICNet: Information Conversion Network for RGB-D Based Salient Object Detection
☆16Nov 17, 2023Updated 2 years ago
OpenGVLab / all-seeing
View on GitHub
[ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …
☆508Aug 9, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
TencentARC / TaCA
View on GitHub
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Jun 20, 2023Updated 3 years ago
kingthreestones / RefCLIP
View on GitHub
☆39Jun 28, 2023Updated 3 years ago
CYVincent / Scene-Graph-Transformer-CogTree
View on GitHub
☆15Jun 11, 2021Updated 5 years ago
SijieSong / CVPR21-Cogrounding_semantic_attention
View on GitHub
☆14Jul 13, 2021Updated 5 years ago
gzaraunitn / autolabel
View on GitHub
☆21May 29, 2023Updated 3 years ago
chrisx599 / Video-Browser
View on GitHub
Official code repo of Video-Browser: Towards Agentic Open-web Video Browsing
☆28Jan 19, 2026Updated 6 months ago
zeakey / iccv2019-fmeasure
View on GitHub
Code accompanying the paper Optimizing the F-measure for Threshold-free Salient Object Detection.
☆29Aug 13, 2019Updated 6 years ago
NLP2CT / ua-cl-nmt
View on GitHub
Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)
☆11Jun 12, 2020Updated 6 years ago
aurooj / SHG-VQA
View on GitHub
Learning Situation Hyper-Graphs for Video Question Answering
☆23Feb 16, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
joslefaure / HERMES
View on GitHub
[ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics
☆37Sep 10, 2025Updated 10 months ago
yrcong / NODIS
View on GitHub
Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020
☆12Aug 28, 2020Updated 5 years ago
mlvlab / OVQA
View on GitHub
Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…
☆18Apr 23, 2024Updated 2 years ago
LeapLabTHU / Pseudo-Q
View on GitHub
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
☆153Jul 13, 2024Updated 2 years ago
yangli18 / VLTVG
View on GitHub
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆97Dec 2, 2022Updated 3 years ago
Sy-Zhang / TCMN-Release
View on GitHub
Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"
☆16Oct 22, 2022Updated 3 years ago
Paranioar / UniPT
View on GitHub
[CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"
☆71Oct 15, 2024Updated last year
Mxbonn / ltmp
View on GitHub
Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…
☆17Nov 24, 2024Updated last year
fangruizhu / self_sup_semiVOS
View on GitHub
☆28Jul 1, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
CoinCheung / Deeplab-Large-FOV
View on GitHub
My Implementation of the deeplab_v1 (known as deeplab large fov)
☆29Mar 6, 2019Updated 7 years ago
leafy-lee / E-commercial-dataset
View on GitHub
the dataset of electronic commercial image used for saliency etc.
☆18Apr 7, 2024Updated 2 years ago
qinzzz / Multimodal-Alignment-Framework
View on GitHub
Implementation for MAF: Multimodal Alignment Framework
☆45Nov 25, 2020Updated 5 years ago
chengyzhao / TextPSG
View on GitHub
☆19Oct 22, 2023Updated 2 years ago
HKUST-LongGroup / CFA
View on GitHub
[ICCV 2023] Compositional Feature Augmentation for Unbiased Scene Graph Generation
☆15Dec 5, 2023Updated 2 years ago
Dawn-LX / OpenVoc-VidVRD
View on GitHub
Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
☆43Jun 4, 2024Updated 2 years ago
qiuyue1993 / Notes
View on GitHub
Research Notes
☆11Sep 13, 2020Updated 5 years ago