(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling
☆212Jul 28, 2024Updated last year
Alternatives and similar repositories for iTPN
Users that are interested in iTPN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆76Mar 1, 2023Updated 3 years ago
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Apr 12, 2022Updated 4 years ago
- ☆103Dec 17, 2024Updated last year
- (SaGe) Semantic-Aware Generation for Self-Supervised Visual Representation Learning☆26Mar 29, 2022Updated 4 years ago
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking☆11Sep 3, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICCV 2025] EA-ViT: Efficient Adaptation for Elastic Vision Transformer☆27Jul 28, 2025Updated 9 months ago
- [NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos☆27Apr 8, 2025Updated last year
- [AAAI2025] SUTrack: Towards Simple and Unified Single Object Tracking☆144Jun 16, 2025Updated 11 months ago
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆19Oct 7, 2024Updated last year
- ☆31Sep 24, 2024Updated last year
- A toolbox for object skeleton detection, can also be used for edge detection, building extraction and road extraction. TIP (2021)☆139Feb 9, 2023Updated 3 years ago
- VMamba: Visual State Space Models,code is based on mamba☆3,150Mar 7, 2025Updated last year
- ☆150Jun 25, 2024Updated last year
- ☆25Apr 3, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- vHeat: Building Vision Models upon Heat Conduction☆280Jun 12, 2025Updated 11 months ago
- 'Discretization-Aware Architecture Search' alleviates the discretization gap in one-shot differentiable NAS. DAAS has been accepted by PR…☆20Jul 30, 2021Updated 4 years ago
- ☆10May 24, 2022Updated 3 years ago
- Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…☆53Oct 19, 2025Updated 7 months ago
- published in IEEE Transactions on Image Processing (TIP), 2023☆27Mar 4, 2023Updated 3 years ago
- CVPR24☆67Aug 4, 2024Updated last year
- [IEEE TCSVT 2025] Event stream based visual object tracking using Mamba/State Space Model☆50Jul 18, 2025Updated 10 months ago
- [NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking☆75Sep 30, 2025Updated 7 months ago
- Robust Tracking via Mamba-based Context-aware Token Learning (AAAI 2025)☆16Nov 6, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- BS2T: Bottleneck Spatial–Spectral Transformer for Hyperspectral Image Classification.☆19Feb 17, 2023Updated 3 years ago
- ESI Highly Cited Papers (2025)☆23Sep 20, 2022Updated 3 years ago
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆111Jun 9, 2023Updated 2 years ago
- [ECCV 2024] Tracking Meets LoRA: Faster Training, Larger Model, Stronger Performance☆114Feb 6, 2026Updated 3 months ago
- [TNNLS 2024] Refocus the Attention for Parameter-Efficient Thermal Infrared Object Tracking☆10Jun 20, 2025Updated 11 months ago
- The official implementation for the CVPR'2025 paper Dynamic Updates for Language Adaptation in Visual-Language Tracking☆39Mar 27, 2025Updated last year
- ☆86Aug 31, 2023Updated 2 years ago
- SeqTrackv2: Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking☆91Mar 26, 2024Updated 2 years ago
- [ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection☆73Oct 15, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The official implementation for the CVPR 2023 paper Joint Visual Grounding and Tracking with Natural Language Specification.☆77Jun 3, 2023Updated 2 years ago
- [CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"☆580May 22, 2023Updated 2 years ago
- ☆21May 7, 2024Updated 2 years ago
- [NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)☆113Aug 5, 2025Updated 9 months ago
- ECCV2022, Point-to-Box Network for Accurate Object Detection via Single Point Supervision☆67Jul 20, 2023Updated 2 years ago
- official repo for `thinking with images through-self-calling`☆26Dec 28, 2025Updated 4 months ago
- [TMM-2023] Official implementation of "Towards Complete and Detail-Preserved Salient Object Detection", A.K.A [Arxiv] SelfReformer☆72Nov 16, 2024Updated last year