[CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.
☆33Jul 12, 2023Updated 2 years ago
Alternatives and similar repositories for SK-VG
Users that are interested in SK-VG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for paper "VD-PCR: Improving Visual Dialog with Pronoun Coreference Resolution"☆10Nov 1, 2022Updated 3 years ago
- This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with H…☆16May 21, 2024Updated last year
- A curated list of research papers in Referring Expression Comprehension (REC)☆46May 13, 2021Updated 4 years ago
- Official codebase for "Ref-NMS: Breaking Proposal Bottlenecks in Two-Stage Referring Expression Grounding"☆22Dec 20, 2020Updated 5 years ago
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆90Sep 30, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆67May 26, 2022Updated 3 years ago
- ☆16Dec 28, 2020Updated 5 years ago
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated last year
- ☆31Nov 17, 2024Updated last year
- [TIP2020] ICNet: Information Conversion Network for RGB-D Based Salient Object Detection☆17Nov 17, 2023Updated 2 years ago
- ☆22Mar 7, 2025Updated last year
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆506Aug 9, 2024Updated last year
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- ☆39Jun 28, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆34Feb 22, 2026Updated last month
- ☆34Mar 10, 2023Updated 3 years ago
- ☆16Jun 11, 2021Updated 4 years ago
- ☆14Jul 13, 2021Updated 4 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Mar 23, 2026Updated 3 weeks ago
- ☆91Apr 15, 2022Updated 4 years ago
- Code accompanying the paper Optimizing the F-measure for Threshold-free Salient Object Detection.☆29Aug 13, 2019Updated 6 years ago
- Learning Situation Hyper-Graphs for Video Question Answering☆23Feb 16, 2024Updated 2 years ago
- ☆13Oct 30, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Open-Vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models (ICCV 20…☆18Apr 23, 2024Updated last year
- Uncertainty-Aware Curriculum Learning for Neural Machine Translation (ACL 2020)☆11Jun 12, 2020Updated 5 years ago
- Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020☆12Aug 28, 2020Updated 5 years ago
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆153Jul 13, 2024Updated last year
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆97Dec 2, 2022Updated 3 years ago
- Codes for our ACM MM 2019 paper: "Exploiting Temporal Relationships in Video Moment Localization with Natural Language"☆16Oct 22, 2022Updated 3 years ago
- ☆28Jul 1, 2020Updated 5 years ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection☆43Jun 4, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- Research Notes☆11Sep 13, 2020Updated 5 years ago
- ☆19Oct 22, 2023Updated 2 years ago
- ☆19Jun 10, 2025Updated 10 months ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- [ICCV 2023] Compositional Feature Augmentation for Unbiased Scene Graph Generation☆15Dec 5, 2023Updated 2 years ago
- My Implementation of the deeplab_v1 (known as deeplab large fov)☆29Mar 6, 2019Updated 7 years ago