facebookresearch / dmae_st
Directed masked autoencoders
☆14Updated 2 years ago
Alternatives and similar repositories for dmae_st:
Users that are interested in dmae_st are comparing it to the libraries listed below
- Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"☆15Updated 3 years ago
- Implementation for NATv2.☆23Updated 4 years ago
- A PyTorch Dataset that caches samples in shared memory, accessible globally to all processes☆21Updated 2 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆11Updated 9 months ago
- We investigated corruption robustness across different architectures including Convolutional Neural Networks, Vision Transformers, and th…☆15Updated 3 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆35Updated 3 years ago
- Local Attention - Flax module for Jax☆20Updated 3 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Updated 2 years ago
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- ☆15Updated last year
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 10 months ago
- Implementation of Kronecker Attention in Pytorch☆18Updated 4 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated last year
- Bag of MLP☆20Updated 3 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated last year
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆10Updated 4 years ago
- ☆12Updated 3 years ago
- ☆17Updated last year
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Updated 3 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 3 weeks ago
- Self-Distillation with weighted ground-truth targets; ResNet and Kernel Ridge Regression☆17Updated 3 years ago
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆23Updated 3 years ago
- ☆24Updated 3 years ago
- Pytorch implementation of StyleGAN2 in my style☆11Updated last year
- Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"☆13Updated 11 months ago
- Official code for the paper: "Metadata Archaeology"☆19Updated last year