MarSaKi / NvEM
[ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"
☆79Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for NvEM
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆72Updated last year
- [ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"☆185Updated last year
- accepted by ICME2023 oral(CCF B)☆63Updated last year
- The implementaion of CoDT on the task of NTU-60+->PKUMMD☆74Updated last year
- Two paper About robot navigation in dynamic environment☆46Updated last year
- [ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation☆148Updated last month
- The official implement of DS2DP [TGRS 2022]☆60Updated 4 months ago
- Panoptic Scene Graph Biased Annotation☆31Updated 4 months ago
- [IJCAI-2022] Official Codes for "SimMC: Simple Masked Contrastive Learning of Skeleton Representations for Unsupervised Person Re-Identif…☆59Updated 3 months ago
- The code for ECCV2022 paper: Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval☆57Updated 2 years ago
- The code is for PBRnet for action detection☆75Updated 3 years ago
- ☆62Updated 11 months ago
- [IJCV-2023] Official Codes for "Hierarchical Skeleton Meta-Prototype Contrastive Learning with Hard Skeleton Mining for Unsupervised Pers…☆63Updated 3 months ago
- ☆57Updated last year
- "Towards Semi-supervised Learning with Non-random Missing Labels" by Yue Duan (ICCV 2023)☆77Updated 3 months ago
- [IJCAI-2021] Official Codes for "Multi-Level Graph Encoding with Structural-Collaborative Relation Learning for Skeleton-Based Person Re-…☆57Updated last year
- Official implementation of "Self-slimmed Vision Transformer" (ECCV2022)☆74Updated 2 years ago
- ☆56Updated last year
- ☆86Updated 4 months ago
- Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval --ICCV2023 Oral☆90Updated last year
- The Pytorch implementation of Grounding 3D Object Affordance from 2D Interactios in Images.☆113Updated last year
- ☆84Updated last year
- Official implementation of BMVC2023 Oral paper: 《Describe Your Facial Expressions by Linking Image Encoders and Large Language Models》☆59Updated 11 months ago
- [IJCAI-2020] Official Codes for "Self-Supervised Gait Encoding with Locality-Aware Attention for Person Re-Identification"☆67Updated last year
- [TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"☆217Updated 3 months ago
- Accepted by ICCV2023, Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-bas…☆103Updated 6 months ago
- [ACMMM-2021] Official Codes for "SM-SGE: A Self-Supervised Multi-Scale Skeleton Graph Encoding Framework for Person Re-Identification"☆57Updated 7 months ago
- Public code for VTSNN: A Virtual Temporal Spiking Neural Network (Fron. Neur.)☆55Updated last year
- Offical implementation of "WHEN SPIKING NEURAL NETWORKS MEET TEMPORAL ATTENTION IMAGE DECODING AND ADAPTIVE SPIKING NEURON" (ICLR2023)☆63Updated last year
- The official implementation of BadHash☆56Updated 2 years ago