(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling
☆211Jul 28, 2024Updated last year
Alternatives and similar repositories for iTPN
Users that are interested in iTPN are comparing it to the libraries listed below
Sorting:
- ☆75Mar 1, 2023Updated 3 years ago
- [ICCV 2025] EA-ViT: Efficient Adaptation for Elastic Vision Transformer☆26Jul 28, 2025Updated 7 months ago
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking☆11Sep 3, 2024Updated last year
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆19Oct 7, 2024Updated last year
- [AAAI2025] SUTrack: Towards Simple and Unified Single Object Tracking☆128Jun 16, 2025Updated 8 months ago
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Apr 12, 2022Updated 3 years ago
- ☆97Dec 17, 2024Updated last year
- [NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos☆27Apr 8, 2025Updated 11 months ago
- VMamba: Visual State Space Models,code is based on mamba☆3,054Mar 7, 2025Updated last year
- ☆149Jun 25, 2024Updated last year
- [AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues☆61May 2, 2025Updated 10 months ago
- (SaGe) Semantic-Aware Generation for Self-Supervised Visual Representation Learning☆26Mar 29, 2022Updated 3 years ago
- ☆24Apr 3, 2024Updated last year
- vHeat: Building Vision Models upon Heat Conduction☆272Jun 12, 2025Updated 8 months ago
- ☆21May 7, 2024Updated last year
- BS2T: Bottleneck Spatial–Spectral Transformer for Hyperspectral Image Classification.☆18Feb 17, 2023Updated 3 years ago
- ☆31Sep 24, 2024Updated last year
- CVPR24☆64Aug 4, 2024Updated last year
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆113Jun 9, 2023Updated 2 years ago
- [IEEE TII 2025] Official Implementation for "Dual-Detector Reoptimization for Federated Weakly Supervised Video Anomaly Detection via Ada…☆26Nov 11, 2025Updated 3 months ago
- Official repository of "TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection"☆11May 25, 2025Updated 9 months ago
- [IEEE TCSVT 2025] Event stream based visual object tracking using Mamba/State Space Model☆45Jul 18, 2025Updated 7 months ago
- SeqTrackv2: Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking☆86Mar 26, 2024Updated last year
- ☆10May 24, 2022Updated 3 years ago
- The official code for the paper Evolved Part Masking for Self-Supervised Learning.☆16Jun 14, 2023Updated 2 years ago
- Robust Tracking via Mamba-based Context-aware Token Learning (AAAI 2025)☆16Nov 6, 2025Updated 4 months ago
- spatio-temporal tasks☆16Jul 15, 2024Updated last year
- ☆88Aug 31, 2023Updated 2 years ago
- [NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking☆73Sep 30, 2025Updated 5 months ago
- [CVPR 2023 Highlight] Masked Image Modeling with Local Multi-Scale Reconstruction☆55Jul 10, 2023Updated 2 years ago
- A toolbox for object skeleton detection, can also be used for edge detection, building extraction and road extraction. TIP (2021)☆140Feb 9, 2023Updated 3 years ago
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆73Oct 15, 2024Updated last year
- published in IEEE Transactions on Image Processing (TIP), 2023☆27Mar 4, 2023Updated 3 years ago
- Can we make visual tracking systems align more closely with human visual perception?☆22Mar 1, 2026Updated last week
- [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining☆107Apr 16, 2025Updated 10 months ago
- [ECCV 2024] Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance☆100Feb 6, 2026Updated last month
- Official Code of Paper "Reversible Column Networks" "RevColv2"☆265Sep 6, 2023Updated 2 years ago
- [CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"☆574May 22, 2023Updated 2 years ago
- ☆27Aug 8, 2022Updated 3 years ago