Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
☆16Jul 1, 2025Updated 9 months ago
Alternatives and similar repositories for MaPeT
Users that are interested in MaPeT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆25Dec 4, 2025Updated 4 months ago
- MIMIC: Masked Image Modeling with Image Correspondences☆16Jun 14, 2024Updated last year
- Baseline Code for CVPR 2023 paper. "Multispectral Video Semantic Segmentation: A Benchmark Dataset and Baseline".☆15Sep 21, 2023Updated 2 years ago
- [ECCV'24] Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities☆52Jul 2, 2025Updated 9 months ago
- Statistical processing of COVID-19 data using Apache Beam for Google Cloud Dataflow in Python. Project for the exam of "Sistemi ed Applic…☆11Apr 20, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆17Feb 20, 2025Updated last year
- ☆13Dec 12, 2022Updated 3 years ago
- [BMVC 2024 Oral ✨] Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization☆20Sep 11, 2024Updated last year
- [MICCAI 2023] Official implementation of our MICCAI 2023 paper "Pick the Best Pre-trained Model: Towards Transferability Estimation for M…☆13Jul 27, 2023Updated 2 years ago
- PyTorch code for BMVC 2019 paper: Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters☆20Jan 4, 2023Updated 3 years ago
- ☆18Jan 2, 2024Updated 2 years ago
- ☆20Dec 12, 2022Updated 3 years ago
- PyTorch code for the paper: "Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation"☆19Aug 5, 2021Updated 4 years ago
- [CVPRW'23 Best Paper Award] Zero-shot Unsupervised Transfer Instance Segmentation☆24Aug 22, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PDF 文件的加密与去密☆18May 15, 2023Updated 2 years ago
- ☆17Oct 21, 2019Updated 6 years ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- Remove glyps from TTF fonts☆13Dec 4, 2025Updated 4 months ago
- Pytorch implementation of DeepLabV1-LargeFOV, DeepLabV2-ResNet101, DeepLabV3, and DeepLabV3+. In progress...☆23Aug 18, 2021Updated 4 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆57May 10, 2025Updated 11 months ago
- Pytorch Code for "A Broad Study on the Transferability of Visual Representations with Contrastive Learning" (ICCV 2021)☆36Mar 14, 2022Updated 4 years ago
- ☆12Jul 19, 2023Updated 2 years ago
- CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022☆29Dec 1, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [CVPR'22] Semi-Supervised Video Semantic Segmentation with Inter-Frame Feature Reconstruction☆29Oct 17, 2022Updated 3 years ago
- [CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning☆40Aug 16, 2023Updated 2 years ago
- Pytorch implementation of TSE attention☆16Jul 9, 2021Updated 4 years ago
- [CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval☆36Sep 12, 2025Updated 7 months ago
- (ICME24) This is the offical repository of iDAT: inverse Distillation Adapter-Tuning.☆13Apr 3, 2024Updated 2 years ago
- 汇集各类字体,包括手写体,韩文等☆11Jan 24, 2024Updated 2 years ago
- A tool to quantify and report the carbon footprint of machine learning computations and communication☆22Sep 5, 2023Updated 2 years ago
- Description of 4 sketch style (4SKST) dataset☆13May 19, 2023Updated 2 years ago
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Apr 1, 2024Updated 2 years ago
- This is a repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆12Nov 21, 2022Updated 3 years ago
- This repository is an official PyTorch implementation of our paper "Feature Distillation Interaction Weighting Network for Lightweight Im…☆13May 6, 2023Updated 2 years ago
- ☆39Jul 20, 2022Updated 3 years ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆29Jan 23, 2024Updated 2 years ago
- AI-SAM: Automatic and Interactive Segment Anything Model☆21Feb 25, 2025Updated last year
- ☆12Oct 31, 2021Updated 4 years ago