google-research / diffstride
TF/Keras code for DiffStride, a pooling layer with learnable strides.
☆125Updated 3 years ago
Alternatives and similar repositories for diffstride:
Users that are interested in diffstride are comparing it to the libraries listed below
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆115Updated 2 years ago
- ☆164Updated 2 years ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆209Updated 10 months ago
- Implementation of ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural Networks, ICML 2021.☆141Updated 3 years ago
- (ICML 2022) Official PyTorch implementation of “Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Rob…☆78Updated 2 years ago
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆225Updated 2 years ago
- Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/21…☆119Updated 2 years ago
- Train ImageNet *fast* in 500 lines of code with FFCV☆140Updated 10 months ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆86Updated 3 years ago
- An unopinionated replacement for PyTorch's Dataset and ImageFolder, that handles Tar archives☆76Updated 2 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆211Updated 2 years ago
- PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)☆140Updated 2 years ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆87Updated last year
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆80Updated last year
- Wavelet Flow: Fast Training of High Resolution Normalizing Flows☆59Updated 3 years ago
- Transformers w/o Attention, based fully on MLPs☆93Updated 11 months ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆102Updated 2 years ago
- Tensorflow 2 implementations of the C-SimCLR and C-BYOL self-supervised visual representation methods from "Compressive Visual Representa…☆37Updated 3 years ago
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆129Updated last year
- Whitening for Self-Supervised Representation Learning | Official repository☆129Updated 2 years ago
- Code for paper "Orthogonal Convolutional Neural Networks".☆116Updated 3 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆204Updated last year
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆206Updated last year
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆118Updated 3 years ago
- Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in P…☆201Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆99Updated 2 years ago
- Official PyTorch implementation for FastDPM, a fast sampling algorithm for diffusion probabilistic models☆82Updated 3 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆251Updated 2 years ago