(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling
☆214Jul 28, 2024Updated last year
Alternatives and similar repositories for iTPN
Users that are interested in iTPN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues☆61May 2, 2025Updated last year
- ☆110Dec 17, 2024Updated last year
- (SaGe) Semantic-Aware Generation for Self-Supervised Visual Representation Learning☆26Mar 29, 2022Updated 4 years ago
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking☆11Sep 3, 2024Updated last year
- [ICCV 2025] EA-ViT: Efficient Adaptation for Elastic Vision Transformer☆27Jul 28, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos☆27Apr 8, 2025Updated last year
- [AAAI2025] SUTrack: Towards Simple and Unified Single Object Tracking☆151Jun 16, 2025Updated last year
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆19Oct 7, 2024Updated last year
- ☆31Sep 24, 2024Updated last year
- A toolbox for object skeleton detection, can also be used for edge detection, building extraction and road extraction. TIP (2021)☆139Feb 9, 2023Updated 3 years ago
- VMamba: Visual State Space Models,code is based on mamba☆3,186Mar 7, 2025Updated last year
- ☆152Jun 25, 2024Updated 2 years ago
- ☆27Apr 3, 2024Updated 2 years ago
- vHeat: Building Vision Models upon Heat Conduction☆281Jun 12, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 'Discretization-Aware Architecture Search' alleviates the discretization gap in one-shot differentiable NAS. DAAS has been accepted by PR…☆20Jul 30, 2021Updated 4 years ago
- ☆11May 24, 2022Updated 4 years ago
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆53Oct 19, 2025Updated 8 months ago
- published in IEEE Transactions on Image Processing (TIP), 2023☆27Mar 4, 2023Updated 3 years ago
- CVPR24☆69Aug 4, 2024Updated last year
- [IEEE TCSVT 2025] Event stream based visual object tracking using Mamba/State Space Model☆51Jul 18, 2025Updated 11 months ago
- [NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking☆76Sep 30, 2025Updated 8 months ago
- Robust Tracking via Mamba-based Context-aware Token Learning (AAAI 2025)☆16Nov 6, 2025Updated 7 months ago
- BS2T: Bottleneck Spatial–Spectral Transformer for Hyperspectral Image Classification.☆19Feb 17, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ESI Highly Cited Papers (2025)☆23Sep 20, 2022Updated 3 years ago
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆112Jun 9, 2023Updated 3 years ago
- [ECCV 2024] Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance☆120Feb 6, 2026Updated 4 months ago
- [CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding☆49Feb 28, 2026Updated 4 months ago
- The official implementation for the CVPR'2025 paper Dynamic Updates for Language Adaptation in Visual-Language Tracking☆41Mar 27, 2025Updated last year
- [TNNLS 2024] Refocus the Attention for Parameter-Efficient Thermal Infrared Object Tracking☆10Jun 20, 2025Updated last year
- The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"☆50Nov 4, 2024Updated last year
- ☆85Aug 31, 2023Updated 2 years ago
- SeqTrackv2: Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking☆94Mar 26, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆73Oct 15, 2024Updated last year
- The official implementation for the CVPR 2023 paper Joint Visual Grounding and Tracking with Natural Language Specification.☆77Jun 3, 2023Updated 3 years ago
- [CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"☆582May 22, 2023Updated 3 years ago
- ☆21May 7, 2024Updated 2 years ago
- [NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)☆113Aug 5, 2025Updated 10 months ago
- ECCV2022, Point-to-Box Network for Accurate Object Detection via Single Point Supervision☆66Jul 20, 2023Updated 2 years ago
- official repo for `thinking with images through-self-calling`☆25Dec 28, 2025Updated 6 months ago