(CVPR2023/TPAMI2024) Integrally Pre-Trained Transformer Pyramid Networks -- A Hierarchical Vision Transformer for Masked Image Modeling
☆214Jul 28, 2024Updated last year
Alternatives and similar repositories for iTPN
Users that are interested in iTPN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆76Mar 1, 2023Updated 3 years ago
- [AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues☆61May 2, 2025Updated last year
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Apr 12, 2022Updated 4 years ago
- ☆105Dec 17, 2024Updated last year
- (SaGe) Semantic-Aware Generation for Self-Supervised Visual Representation Learning☆26Mar 29, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking☆11Sep 3, 2024Updated last year
- [ICCV 2025] EA-ViT: Efficient Adaptation for Elastic Vision Transformer☆27Jul 28, 2025Updated 10 months ago
- [NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos☆27Apr 8, 2025Updated last year
- [AAAI2025] SUTrack: Towards Simple and Unified Single Object Tracking☆147Jun 16, 2025Updated 11 months ago
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆18Oct 7, 2024Updated last year
- ☆31Sep 24, 2024Updated last year
- A toolbox for object skeleton detection, can also be used for edge detection, building extraction and road extraction. TIP (2021)☆139Feb 9, 2023Updated 3 years ago
- VMamba: Visual State Space Models,code is based on mamba☆3,172Mar 7, 2025Updated last year