wangpengnorman/KB-Ref_dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wangpengnorman/KB-Ref_dataset)

wangpengnorman / KB-Ref_dataset

☆16

Alternatives and similar repositories for KB-Ref_dataset

Users that are interested in KB-Ref_dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

doubledaibo / clcaption_nips2017
View on GitHub
Contrastive Learning for Image Captioning
☆51Feb 22, 2018Updated 8 years ago
chrisc36 / bottom-up-attention-vqa
View on GitHub
BottomUpTopDown VQA model with question-type debiasing
☆22Oct 6, 2019Updated 6 years ago
zyang-ur / ReSC
View on GitHub
Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020
☆90Sep 30, 2021Updated 4 years ago
daqingliu / awesome-rec
View on GitHub
A curated list of research papers in Referring Expression Comprehension (REC)
☆46May 13, 2021Updated 5 years ago
google-research-datasets / Video-Timeline-Tags-ViTT
View on GitHub
A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…
☆30Jan 15, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lichengunc / refer-parser2
View on GitHub
Referring Expression Parser
☆27Feb 10, 2018Updated 8 years ago
PAL-ML / PEARL_v1
View on GitHub
☆30Jan 17, 2022Updated 4 years ago
daicoolb / Awesome-Video-Captioning
View on GitHub
video captioning
☆24Mar 14, 2019Updated 7 years ago
ByZ0e / Glance-Focus
View on GitHub
This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)
☆31Jun 28, 2024Updated 2 years ago
visinf / lnfmm
View on GitHub
Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)
☆33May 12, 2022Updated 4 years ago
zhjohnchan / SK-VG
View on GitHub
[CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.
☆34Jul 12, 2023Updated 3 years ago
ZhecanJamesWang / GLAT_SGG
View on GitHub
Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"
☆11Dec 16, 2020Updated 5 years ago
Subangkar / N-Puzzle-Problem-CPP-Implementation-using-A-Star-Search
View on GitHub
A C++ implementation of N Puzzle problem using A Star Search with heuristics of Manhattan Distance, Hamming Distance & Linear Conflicts
☆10Dec 3, 2018Updated 7 years ago
zyang-ur / SAT
View on GitHub
SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)
☆32Sep 29, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yanxinzju / CSS-VQA
View on GitHub
Counterfactual Samples Synthesizing for Robust VQA
☆78Nov 24, 2022Updated 3 years ago
bengsfort / 8-puzzle-solutions
View on GitHub
Solutions to the classic 8 puzzle, implementing an A* best search algorithm to complete the puzzle in multiple languages.
☆16Nov 1, 2018Updated 7 years ago
jlian2 / mucko
View on GitHub
Pytorch Implementation of MUCKO(2020 IJCAI)
☆20Oct 25, 2020Updated 5 years ago
jialinwu17 / MAVEX
View on GitHub
☆30Dec 16, 2022Updated 3 years ago
thecharm / MNRE
View on GitHub
Resource and Code for ICME 2021 paper "MNRE: A Challenge Multimodal Dataset for Neural Relation Extraction with Visual Evidence in Social…
☆74Nov 23, 2021Updated 4 years ago
yrcong / NODIS
View on GitHub
Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020
☆12Aug 28, 2020Updated 5 years ago
XL2248 / SOV-MAS
View on GitHub
The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"
☆11May 16, 2023Updated 3 years ago
Buzz-Beater / EgoTaskQA
View on GitHub
Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.
☆45Apr 17, 2023Updated 3 years ago
lichengunc / MAttNet
View on GitHub
MAttNet: Modular Attention Network for Referring Expression Comprehension
☆299Nov 29, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
WangFei-2019 / SNARE
View on GitHub
Project for SNARE benchmark
☆11Jun 5, 2024Updated 2 years ago
Peratham / video2text.pytorch
View on GitHub
PyTorch implementation of video captioning
☆13Sep 24, 2017Updated 8 years ago
hy0523 / MTNet
View on GitHub
Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation[TNNLS2024]
☆14May 6, 2025Updated last year
Adam1679 / mutan-article-net
View on GitHub
Implementation of Mutan+ArticleNet on OKVQA
☆10Jan 11, 2021Updated 5 years ago
zjuruizhechen / TVG-R1
View on GitHub
[EMNLP 2025 Industry] Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning
☆36Oct 22, 2025Updated 9 months ago
AmingWu / CCN
View on GitHub
Connective Cognition Network for Directional Visual Commonsense Reasoning
☆15May 6, 2021Updated 5 years ago
sairin1202 / Commonsense-Knowledge-Aware-Concept-Selection-For-Diverse-and-Informative-Visual-Storytelling
View on GitHub
The implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling
☆12Aug 19, 2021Updated 4 years ago
gchhablani / multilingual-image-captioning
View on GitHub
☆43Aug 2, 2021Updated 4 years ago
ZiyueHuang / MXSeq2Seq
View on GitHub
seq2seq with attention in mxnet
☆18Oct 13, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hassanhub / MultiGrounding
View on GitHub
This is the repo for Multi-level textual grounding
☆34Jul 21, 2020Updated 6 years ago
hughplay / TVR
View on GitHub
Transformation Driven Visual Reasoning - CVPR 2021
☆36May 27, 2023Updated 3 years ago
knightyxp / DGL
View on GitHub
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
☆49Oct 14, 2024Updated last year
jiangzhubo / What-is-Fine-tuning
View on GitHub
PPT for Fine tuning
☆11Apr 22, 2018Updated 8 years ago
allenschmaltz / word_ordering
View on GitHub
This repository includes code for replicating the results in the paper "Word Ordering Without Syntax" (2016).
☆21Dec 8, 2016Updated 9 years ago
chihyaoma / cyclical-visual-captioning
View on GitHub
PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision
☆46Jul 29, 2020Updated 5 years ago
iacercalixto / butd-image-captioning
View on GitHub
Bottom-up Top-down image captioning model with PyTorch.
☆14Dec 5, 2020Updated 5 years ago