[AAAI-2024] Structural Information Guided Multimodal Pre-training for Vehicle-centric Perception, Xiao Wang, Wentao Wu, Chenglong Li, Zhicheng Zhao, Zhe Chen, Yukai Shi, Jin Tang
☆27Jul 29, 2024Updated last year
Alternatives and similar repositories for VehicleMAE
Users that are interested in VehicleMAE are comparing it to the libraries listed below
Sorting:
- ☆10Dec 16, 2023Updated 2 years ago
- This repository summarizes the human-centered applications of event data☆13Jan 31, 2025Updated last year
- Official repository of the UPAR dataset for pedestrian attribute recognition and attribute-based person retrieval☆14Jan 22, 2024Updated 2 years ago
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.☆13Feb 28, 2024Updated 2 years ago
- Papers of "A Survey on Multimodal LLMs from the Perspective of Input-Output Space Extension"☆17Feb 4, 2026Updated 3 weeks ago
- ☆10Feb 15, 2025Updated last year
- Official PyTorch implementation of paper: "Diffusion-based Synthetic Data Generation for Visible-Infrared Person Re-Identification".☆17May 27, 2025Updated 9 months ago
- [NeurIPS2024] PLIP: Language-Image Pre-training for Person Representation Learning☆132Dec 17, 2024Updated last year
- Event based Sign-Language-Translation☆19Updated this week
- ☆15Jun 17, 2025Updated 8 months ago
- Official pytorch implementation of the ICML2024 main conference paper: Pedestrian Attribute Recognition as Label-balanced Multi-label Lea…☆13Jul 22, 2024Updated last year
- [ICASSP'25] Enhancing Vision-Language Tracking by Effectively Converting Textual Cues into Visual Cues☆17Dec 31, 2024Updated last year
- ☆14Sep 12, 2020Updated 5 years ago
- ☆15Dec 3, 2021Updated 4 years ago
- ☆24Apr 3, 2024Updated last year
- ☆17Mar 30, 2024Updated last year
- WebUAV-3M: A million-scale multi-modal UAV tracking benchmark☆66Sep 5, 2025Updated 5 months ago
- Repo of NeurIPS23☆18Oct 25, 2023Updated 2 years ago
- [CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity☆79Sep 28, 2024Updated last year
- [IEEE TMM 2025] CRSOT: Cross-Resolution Object Tracking using Unaligned Frame and Event Cameras☆21Jan 18, 2025Updated last year
- ☆20Oct 30, 2024Updated last year
- The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"☆45Nov 4, 2024Updated last year
- The official implementation for the paper [Towards Unified Token Learning for Vision-Language Tracking].☆23Dec 13, 2023Updated 2 years ago
- Hwangsae is stork, not crane.☆18Mar 7, 2022Updated 3 years ago
- Video Diffusion State Space Models☆19Mar 27, 2024Updated last year
- [ICCV 2023] Robust Object Modeling for Visual Tracking, Official Implementation☆47Jan 5, 2025Updated last year
- [NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception☆44Mar 25, 2024Updated last year
- Improving Mamaba performance on Video Understanding task☆45Dec 30, 2025Updated 2 months ago
- Bi-directional Adapter for Multi-modal Tracking☆96Mar 19, 2024Updated last year
- The implement of "Learning Spatial-Frequency Transformer for Visual Object Tracking"☆20Jun 29, 2023Updated 2 years ago
- An official implementation of "Video-based Person Re-identification with Spatial and Temporal Memory Networks" (ICCV 2021) in PyTorch.☆51Nov 1, 2021Updated 4 years ago
- Object Detection for Video with MXNet and GluonCV using YOLOv3☆22Nov 21, 2022Updated 3 years ago
- ☆27Jun 4, 2024Updated last year
- Event-based Person ReId☆25Sep 9, 2024Updated last year
- [NeurIPS 2023 Spotlight] ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking☆19Mar 4, 2024Updated last year
- [CVPR 2024] SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking☆57Jun 30, 2024Updated last year
- ☆25Dec 23, 2024Updated last year
- Rotation Equivariant Siamese Networks for Tracking☆27Jun 18, 2021Updated 4 years ago
- ☆54Apr 13, 2023Updated 2 years ago