Natyren / MixedAELinks
This is open-source implementation of MixedAE (https://arxiv.org/pdf/2303.17152.pdf)
☆22Updated 11 months ago
Alternatives and similar repositories for MixedAE
Users that are interested in MixedAE are comparing it to the libraries listed below
Sorting:
- A Contrastive Learning Boost from Intermediate Pre-Trained Representations☆43Updated last year
- Framework for processing and filtering datasets☆31Updated last year
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆58Updated 2 years ago
- ☆22Updated 2 years ago
- ScrollNet for Continual Learning☆11Updated 2 years ago
- A simple Lightning Memory-Mapped Database (LMDB) converter for ImageFolder datasets in PyTorch. Using LMDB over a regular file structure …☆20Updated 4 years ago
- EasyPortrait - Face Parsing and Portrait Segmentation Dataset☆28Updated last year
- [NeurIPS 2024] Official implementation of the paper "Interfacing Foundation Models' Embeddings"☆129Updated last year
- Official Implementation for "HyperDomainNet: Universal Domain Adaptation for Generative Adversarial Networks" (NeurIPS 2022)☆92Updated 2 years ago
- Code and models for the paper "The effectiveness of MAE pre-pretraining for billion-scale pretraining" https://arxiv.org/abs/2303.13496☆92Updated this week
- Code release for "Improved baselines for vision-language pre-training"☆62Updated last year
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆28Updated last year
- ☆56Updated 2 years ago
- Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)☆11Updated last year
- PyTorch Implementation of Object Recognition as Next Token Prediction [CVPR'24 Highlight]☆182Updated 9 months ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆102Updated last year
- Pytorch based library to rank predicted bounding boxes using text/image user's prompts.☆52Updated 4 years ago
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆128Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Updated last year
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆98Updated last year
- Video-LlaVA fine-tune for CinePile evaluation☆51Updated last year
- ☆191Updated last year
- ☆37Updated 2 years ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆60Updated 2 years ago
- Some augmentations that I hasn't found in other repositories and libraries.☆26Updated 2 years ago
- Data-Efficient Multimodal Fusion on a Single GPU☆68Updated last year
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆226Updated last year
- Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders☆131Updated 10 months ago
- OmniFusion — a multimodal model to communicate using text and images☆234Updated last year
- 4th place solution for the Google Universal Image Embedding Kaggle Challenge. Instance-Level Recognition workshop at ECCV 2022☆43Updated 2 years ago