lessw2020 / Ranger22
Testing various improvements to Ranger21 for 2022
☆18Updated 2 years ago
Related projects: ⓘ
- ☆41Updated 3 years ago
- ☆36Updated this week
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- Simple MAE (masked autoencoders) with pytorch and pytorch-lightning.☆39Updated 7 months ago
- Implementation of LogAvgExp for Pytorch☆32Updated 2 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆115Updated last year
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆74Updated 2 years ago
- Another attempt at a long-context / efficient transformer by me☆37Updated 2 years ago
- A collection of optimizers, some arcane others well known, for Flax.☆29Updated 3 years ago
- Implementation of Uformer, Attention-based Unet, in Pytorch☆92Updated 2 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆19Updated last year
- Pytorch cyclic cosine decay learning rate scheduler☆46Updated 3 years ago
- A convolution-free, transformer-only version of the CycleGAN framework☆32Updated 2 years ago
- Framework for creating (partially) reversible neural networks with PyTorch☆144Updated 2 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 2 years ago
- Implementations of Recent Papers in Computer Vision☆39Updated 2 years ago
- ☆25Updated 2 years ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆76Updated 8 months ago
- PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes☆42Updated 4 months ago
- An open source implementation of CLIP.☆32Updated last year
- This is a offical PyTorch/GPU implementation of SupMAE.☆76Updated 2 years ago
- PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Repr…☆21Updated 2 years ago
- ☆6Updated 9 months ago
- diffGrad: An Optimization Method for Convolutional Neural Networks☆54Updated last year
- Code of "Deep invariant networks with differentiable augmentation layers"☆18Updated last year
- The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…☆58Updated 3 years ago
- ☆36Updated 3 months ago
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆120Updated 8 months ago
- Simple but high-performing method for learning a policy of test-time augmentation☆38Updated last year
- ☆73Updated last year