ariG23498 / TokenLearner
TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"
☆33Updated 3 years ago
Alternatives and similar repositories for TokenLearner:
Users that are interested in TokenLearner are comparing it to the libraries listed below
- This is a offical PyTorch/GPU implementation of SupMAE.☆78Updated 2 years ago
- ☆52Updated 2 years ago
- [ICME 2022] code for the paper, SimVit: Exploring a simple vision transformer with sliding windows.☆68Updated 2 years ago
- Masked Vision-Language Transformer in Fashion☆33Updated last year
- Official codes for ConMIM (ICLR 2023)☆58Updated 2 years ago
- ☆34Updated last year
- [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recogniti…☆19Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- ☆54Updated 2 years ago
- ☆58Updated 2 years ago
- Official pytorch implementation of the IrwGAN for unaligned image-to-image translation☆34Updated 3 years ago
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆55Updated 8 months ago
- HIRL: A General Framework for Hierarchical Image Representation Learning (http://arxiv.org/abs/2205.13159)☆40Updated 2 years ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆49Updated 2 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆55Updated last year
- GroupViT: Semantic Segmentation Emerges from Text Supervision☆25Updated 2 years ago
- [ACM MM 2022] Towards Counterfactual Image Manipulation via CLIP☆37Updated 2 years ago
- ☆16Updated last year
- FastMIM, official pytorch implementation of our paper "FastMIM: Expediting Masked Image Modeling Pre-training for Vision"(https://arxiv.o…☆39Updated 2 years ago
- i-mae Pytorch Repo☆20Updated last year
- ☆26Updated 3 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆63Updated last month
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- Official Code of ECCV 2022 paper MS-CLIP☆89Updated 2 years ago
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆58Updated last year
- Clipora is a powerful toolkit for fine-tuning OpenCLIP models using Low Rank Adapters (LoRA).☆21Updated 8 months ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆35Updated 2 years ago
- Official Pytorch codebase for Open-Vocabulary Instance Segmentation without Manual Mask Annotations [CVPR 2023]☆50Updated 4 months ago
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Updated 3 years ago
- ☆21Updated 2 years ago