☆83Jun 30, 2022Updated 3 years ago
Alternatives and similar repositories for Zero-Shot-Detection-via-Vision-and-Language-Knowledge-Distillation
Users that are interested in Zero-Shot-Detection-via-Vision-and-Language-Knowledge-Distillation are comparing it to the libraries listed below
Sorting:
- ☆188Nov 7, 2022Updated 3 years ago
- Refer-Youtube-VOS dataset☆27Mar 10, 2026Updated last week
- 🚴♂️ ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection (MM 2020)☆35Jul 2, 2025Updated 8 months ago
- ☆11May 18, 2022Updated 3 years ago
- Codes and scripts for "Explainable Semantic Space by Grounding Languageto Vision with Cross-Modal Contrastive Learning"☆20Mar 23, 2022Updated 4 years ago
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆17Jan 5, 2026Updated 2 months ago
- This is the pytorch implementation for the paper "Delta-encoder: an effective sample synthesis method for few-shot object recognition" ht…☆11Jan 10, 2020Updated 6 years ago
- The Pytorch code of "Asymmetric Distribution Measure for Few-shot Learning", IJCAI 2020.☆15Oct 9, 2020Updated 5 years ago
- MSRSegNet: Multi-Scale Residual Network for Semantic Segmentation☆10Aug 9, 2018Updated 7 years ago
- [NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary …☆297Oct 12, 2022Updated 3 years ago
- Caffe re-implementation of dynamic network surgery.☆18Jun 15, 2018Updated 7 years ago
- PIC Challenge Baseline☆18Dec 27, 2018Updated 7 years ago
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Nov 7, 2024Updated last year
- LSTC: Boosting Atomic Action Detection with Long-Short-Term Context☆10Sep 1, 2022Updated 3 years ago
- ☆10Jan 3, 2023Updated 3 years ago
- The Station, an open-world multi-agent environment that models a miniature scientific ecosystem.☆111Feb 9, 2026Updated last month
- We release the DaTaSeg Objects365 Instance Segmentation Dataset introduced in the DaTaSeg paper, which can be used as an evaluation bench…☆21Dec 9, 2023Updated 2 years ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆545Sep 15, 2023Updated 2 years ago
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding [AAAI 23].☆16Apr 13, 2023Updated 2 years ago
- ☆12Aug 5, 2022Updated 3 years ago
- pytorch code for recurring paper:Zero-Shot Detection【https://arxiv.org/abs/1803.07113】☆19Jan 16, 2019Updated 7 years ago
- A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning☆26Jan 20, 2022Updated 4 years ago
- ☆164Apr 6, 2023Updated 2 years ago
- [ECCV 2020] Official Matlab implementation of rOSD: Toward unsupervised, multi-object discovery in large-scale image collections.☆10Nov 4, 2021Updated 4 years ago
- ☆129Jan 20, 2022Updated 4 years ago
- VQA baseline with Conditional Batch Normalization☆15Apr 9, 2018Updated 7 years ago
- Grounded Language-Image Pre-training☆2,580Jan 24, 2024Updated 2 years ago
- Zero Shot Detection Dataset☆25Jul 22, 2019Updated 6 years ago
- This is a modified version of Ankush's code for generating synthetic text images which support right-to-left languages such as Persian a…☆20Jul 3, 2019Updated 6 years ago
- Free-form Description-guided 3D Visual Graph Networks for Object Grounding in Point Cloud☆17Jun 23, 2022Updated 3 years ago
- Implementation of 3D attention mechanisms based on https://github.com/LeftAttention/Attention-Codebase. Thanks to LeftAttetnion for shari…☆12Feb 22, 2022Updated 4 years ago
- Pytorch 1.0 codes(including cuda codes) for Deformable Convolution Version 2☆18Mar 2, 2019Updated 7 years ago
- Pytorch implementation of CVPR'16 paper "Learning Deep Representations of Fine-Grained Visual Descriptions", by Reed et al.☆18Aug 16, 2020Updated 5 years ago
- Dataset and baseline for Scenario Oriented Object Navigation (SOON)☆23Nov 23, 2021Updated 4 years ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆52Oct 26, 2025Updated 4 months ago
- Code and Data for paper: Knowledge Perceived Multi-modal Pretraining in E-commerce (ACM MM2021)☆28Oct 10, 2022Updated 3 years ago
- Official PyTorch implementation of "TDAM: Top-down attention module for CNNs"☆13Oct 29, 2022Updated 3 years ago
- A new framework for open-vocabulary object detection, based on maskrcnn-benchmark☆248Feb 11, 2023Updated 3 years ago
- ICME 2017 "Learning a Multi-Center Convolutional Network for Unconstrained Face Alignment"☆16Mar 28, 2020Updated 5 years ago