Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention (CVPR 2023)
☆32Mar 28, 2023Updated 2 years ago
Alternatives and similar repositories for DependencyViT
Users that are interested in DependencyViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Masking Strategies for Background Bias Removal in Computer Vision Models (ICCVW OODCV 2023 paper)☆16Jul 3, 2025Updated 8 months ago
- ☆16Apr 10, 2025Updated 11 months ago
- ☆44Dec 14, 2022Updated 3 years ago
- ☆11Dec 21, 2023Updated 2 years ago
- (IJCV 2023) Offical implementation of "SCT: A Simple Baseline for Parameter-Efficient Fine-Tuning via Salient Channels"☆13Mar 20, 2025Updated last year
- The human parsing network used in ViTAA, which is specially trained for reid datasets.☆17Oct 12, 2021Updated 4 years ago
- Code release for "Clue Me In: Semi-Supervised FGVC with Out-of-Distribution Data".☆13Apr 11, 2022Updated 3 years ago
- Convolutional Fine-Grained Classification with Self-Supervised Target Relation Regularization (TIP 2022)☆12Sep 8, 2022Updated 3 years ago
- For CVPR2022 Submission☆10Sep 4, 2022Updated 3 years ago
- Repository for conditional transport☆15Jan 12, 2022Updated 4 years ago
- ☆11Aug 31, 2023Updated 2 years ago
- ☆15Mar 9, 2023Updated 3 years ago
- Transformer-based Weakly-Supervised Change Detection. Code for paper: Exploring Effective Priors and Efficient Models for Weakly-Supervis…☆40Dec 15, 2025Updated 3 months ago
- Codebase for AAAI 2024 conference paper Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning☆39Mar 12, 2025Updated last year
- GMNet: Graded-Feature Multilabel-Learning Network for RGB-Thermal Urban Scene Semantic Segmentation☆21Dec 8, 2021Updated 4 years ago
- A PyTorch implementation of SIN.☆12Oct 20, 2021Updated 4 years ago
- The application demonstrates the Image Semantic Segmentation considering the input image in portrait mode and changes the background to g…☆18Sep 17, 2019Updated 6 years ago
- ☆36Jun 27, 2023Updated 2 years ago
- Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs☆34Sep 21, 2025Updated 6 months ago
- This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).☆37Jul 8, 2024Updated last year
- Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO☆54Sep 3, 2020Updated 5 years ago
- SVGD implementation☆12Jul 23, 2018Updated 7 years ago
- High Resolution Class Activation Mapping for Discriminative Feature Localization☆13Jan 7, 2021Updated 5 years ago
- [NeurIPS 2025 Datasets & Benchmarks Track] The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models☆35Oct 26, 2025Updated 4 months ago
- ☆17Aug 14, 2024Updated last year
- Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)"☆17Jun 2, 2023Updated 2 years ago
- MDRNet+:Mitigating Modality Discrepancies for RGB-T Semantic Segmentation (ABMDRNet extended version)☆22Feb 9, 2023Updated 3 years ago
- Pytorch implementation of F-CAM. Paper: "F-CAM: Full Resolution Class Activation Maps via Guided Parametric Upscaling".☆15Jan 21, 2023Updated 3 years ago
- This is the project page for our ECCV2020 paper: "Deep near-light photometric stereo for spatially varying reflectances".☆10Jul 29, 2024Updated last year
- Code for WACV 2021 Paper "Meta Module Network for Compositional Visual Reasoning"☆43May 13, 2021Updated 4 years ago
- ☆15Sep 21, 2022Updated 3 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆18Mar 15, 2021Updated 5 years ago
- [NeurIPS 2022 Spotlight] Learning Equivariant Segmentation with Instance-Unique Querying☆22Dec 17, 2022Updated 3 years ago
- [CVPR 2025] Offical implementation of the paper "Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters The…☆31Mar 12, 2026Updated last week
- Code for the paper "Multi-Task Learning of Object States and State-Modifying Actions from Web Videos" published in TPAMI☆11Mar 3, 2024Updated 2 years ago
- Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023☆14Apr 1, 2024Updated last year
- distill large scale web page text☆12Jul 29, 2023Updated 2 years ago
- MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering…☆13Feb 18, 2023Updated 3 years ago
- ☆17Sep 2, 2023Updated 2 years ago