sunsmarterjie/iTPN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sunsmarterjie/iTPN)

sunsmarterjie / iTPN

(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling

☆215

Alternatives and similar repositories for iTPN

Users that are interested in iTPN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhangxiaosong18 / hivit
View on GitHub
☆76Mar 1, 2023Updated 3 years ago
sunsmarterjie / ChatterBox
View on GitHub
[AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues
☆61May 2, 2025Updated last year
sunsmarterjie / beyond_masking
View on GitHub
Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers
☆26Apr 12, 2022Updated 4 years ago
sunsmarterjie / SaGe
View on GitHub
(SaGe) Semantic-Aware Generation for Self-Supervised Visual Representation Learning
☆26Mar 29, 2022Updated 4 years ago
kangben258 / MCITrack
View on GitHub
☆110Dec 17, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OrigamiSL / OTETrack
View on GitHub
Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking
☆11Sep 3, 2024Updated last year
qiujihao19 / Artemis
View on GitHub
[NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos
☆27Apr 8, 2025Updated last year
mingrui-wu / OSI-Bench
View on GitHub
Official repo of From Indoor to Open World: Revealing the Spatial Reasoning Gap in MLLMs
☆24Jun 23, 2026Updated 3 weeks ago
AkitsukiM / VMamba-DOTA
View on GitHub
☆31Sep 24, 2024Updated last year
sunsmarterjie / SDL-Skeleton
View on GitHub
A toolbox for object skeleton detection, can also be used for edge detection, building extraction and road extraction. TIP (2021)
☆138Feb 9, 2023Updated 3 years ago
XiaokunFeng / MemVLT
View on GitHub
[NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts
☆19Oct 7, 2024Updated last year
MzeroMiko / VMamba
View on GitHub
VMamba: Visual State Space Models，code is based on mamba
☆3,206Mar 7, 2025Updated last year
hhb072 / STViT
View on GitHub
☆152Jun 25, 2024Updated 2 years ago
zcxcf / EA-ViT
View on GitHub
[ICCV 2025] EA-ViT: Efficient Adaptation for Elastic Vision Transformer
☆27Jul 28, 2025Updated 11 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
sunsmarterjie / DAAS
View on GitHub
'Discretization-Aware Architecture Search' alleviates the discretization gap in one-shot differentiable NAS. DAAS has been accepted by PR…
☆20Jul 30, 2021Updated 4 years ago
MzeroMiko / vHeat
View on GitHub
vHeat: Building Vision Models upon Heat Conduction
☆282Jun 12, 2025Updated last year
chenxin-dlut / SUTrack
View on GitHub
[AAAI2025] SUTrack: Towards Simple and Unified Single Object Tracking
☆157Jun 16, 2025Updated last year
Yang-Bob / DSN
View on GitHub
☆11May 24, 2022Updated 4 years ago
ucas-vg / GroupSampling
View on GitHub
published in IEEE Transactions on Image Processing (TIP), 2023
☆27Mar 4, 2023Updated 3 years ago
GXNU-ZhongLab / AQATrack
View on GitHub
CVPR24
☆71Aug 4, 2024Updated last year
MzeroMiko / XDLM
View on GitHub
[ICML 2026 Spotlight] Code for miXed Discrete Diffusion Language Model
☆27Mar 16, 2026Updated 4 months ago
HengLan / VastTrack
View on GitHub
[NeurIPS 2024] VastTrack: Vast Category Visual Object Tracking
☆76Sep 30, 2025Updated 9 months ago
Event-AHU / MambaEVT
View on GitHub
[IEEE TCSVT 2025] Event stream based visual object tracking using Mamba/State Space Model
☆51Jul 18, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
srxlnnu / BS2T
View on GitHub
BS2T: Bottleneck Spatial–Spectral Transformer for Hyperspectral Image Classification.
☆19Feb 17, 2023Updated 3 years ago
ZJier / CTMixer
View on GitHub
ESI Highly Cited Papers (2025)
☆23Sep 20, 2022Updated 3 years ago
WenRuiCai / SPMTrack
View on GitHub
Official implementation of "SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Trackin…
☆55Oct 19, 2025Updated 9 months ago
facebookresearch / r-mae
View on GitHub
PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411
☆112Jun 9, 2023Updated 3 years ago
pengzhiliang / G2SD
View on GitHub
☆85Aug 31, 2023Updated 2 years ago
OpenSpaceAI / UVLTrack
View on GitHub
The official pytorch implementation of our AAAI 2024 paper "Unifying Visual and Vision-Language Tracking via Contrastive Learning"
☆50Nov 4, 2024Updated last year
laisimiao / ReFocus_TIR_Tracking
View on GitHub
[TNNLS 2024] Refocus the Attention for Parameter-Efficient Thermal Infrared Object Tracking
☆10Jun 20, 2025Updated last year
GXNU-ZhongLab / DUTrack
View on GitHub
The official implementation for the CVPR'2025 paper Dynamic Updates for Language Adaptation in Visual-Language Tracking
☆43Mar 27, 2025Updated last year
chenxin-dlut / SeqTrackv2
View on GitHub
SeqTrackv2: Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking
☆94Mar 26, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
GXNU-ZhongLab / TemTrack
View on GitHub
Robust Tracking via Mamba-based Context-aware Token Learning (AAAI 2025)
☆16Nov 6, 2025Updated 8 months ago
LiewFeng / imTED
View on GitHub
[ICCV 2023] Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual Object Detection
☆73Oct 15, 2024Updated last year
lizhou-cs / JointNLT
View on GitHub
The official implementation for the CVPR 2023 paper Joint Visual Grounding and Tracking with Natural Language Specification.
☆78Jun 3, 2023Updated 3 years ago
rayleizhu / BiFormer
View on GitHub
[CVPR 2023] Official code release of our paper "BiFormer: Vision Transformer with Bi-Level Routing Attention"
☆581May 22, 2023Updated 3 years ago
GXNU-ZhongLab / EVPTrack
View on GitHub
☆29Apr 3, 2024Updated 2 years ago
ucas-vg / P2BNet
View on GitHub
ECCV2022, Point-to-Box Network for Accurate Object Detection via Single Point Supervision
☆65Jul 20, 2023Updated 3 years ago
OpenGVLab / PIIP
View on GitHub
[NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)
☆113Aug 5, 2025Updated 11 months ago