☆16Dec 28, 2020Updated 5 years ago
Alternatives and similar repositories for KB-Ref_dataset
Users that are interested in KB-Ref_dataset are comparing it to the libraries listed below
Sorting:
- A curated list of research papers in Referring Expression Comprehension (REC)☆46May 13, 2021Updated 4 years ago
- Contrastive Learning for Image Captioning☆51Feb 22, 2018Updated 8 years ago
- video captioning☆24Mar 14, 2019Updated 6 years ago
- BottomUpTopDown VQA model with question-type debiasing☆22Oct 6, 2019Updated 6 years ago
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.☆33Jul 12, 2023Updated 2 years ago
- A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…☆29Jan 15, 2022Updated 4 years ago
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆31Jun 28, 2024Updated last year
- Referring Expression Parser☆27Feb 10, 2018Updated 8 years ago
- SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)☆33Sep 29, 2021Updated 4 years ago
- ☆18Jun 10, 2025Updated 8 months ago
- ☆30Dec 16, 2022Updated 3 years ago
- Transformation Driven Visual Reasoning - CVPR 2021☆36May 27, 2023Updated 2 years ago
- Torch Implementation of Speaker-Listener-Reinforcer for Referring Expression Generation and Comprehension☆34Mar 8, 2018Updated 7 years ago
- Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)☆33May 12, 2022Updated 3 years ago
- ☆39Jun 28, 2023Updated 2 years ago
- 收集了一些经典的神经网络论文☆12Aug 11, 2024Updated last year
- Code Repository for Research Article Titled - "Omnidirectional Video Super-Resolution using Deep Learning"☆14Apr 16, 2023Updated 2 years ago
- 我的中文在线简历。My Resume in Zh-CN.☆12Dec 19, 2023Updated 2 years ago
- RS Generate dataset☆16Jan 2, 2025Updated last year
- ☆10Jun 14, 2024Updated last year
- ☆14Aug 28, 2024Updated last year
- [CVPR2024] Dataset and Code of "CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement".☆14Dec 14, 2024Updated last year
- Counterfactual Samples Synthesizing for Robust VQA☆79Nov 24, 2022Updated 3 years ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆47Oct 14, 2024Updated last year
- ☆11Aug 12, 2024Updated last year
- Task Aware Downscaling for efficient storing and accurate reconstruction in image and video domain☆12Jul 25, 2024Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 10 months ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- The code for AIM2022 compressed image super-resolution☆11Nov 30, 2022Updated 3 years ago
- WBSR: Rethinking Imbalance in Image Super-Resolution for Efficient Inference☆13Oct 8, 2024Updated last year
- Conditional Latent Coding (CLC) for Deep Image Compression☆15Feb 6, 2026Updated 3 weeks ago
- Beyond Degradation Redundancy: Contrastive Prompt Learning for All-in-One Image Restoration☆23Feb 23, 2026Updated last week
- ☆10Mar 21, 2022Updated 3 years ago
- Facebook SAM3例程☆44Jan 23, 2026Updated last month
- quagga☆10Apr 7, 2020Updated 5 years ago
- ☆16Oct 9, 2024Updated last year
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆27Oct 29, 2025Updated 4 months ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 7 months ago