google-research / diffstrideLinks
TF/Keras code for DiffStride, a pooling layer with learnable strides.
☆124Updated 3 years ago
Alternatives and similar repositories for diffstride
Users that are interested in diffstride are comparing it to the libraries listed below
Sorting:
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Updated 2 years ago
- ☆164Updated 2 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆217Updated 2 years ago
- Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/21…☆123Updated 2 years ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆208Updated last year
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆228Updated 3 years ago
- Tensorflow 2 implementations of the C-SimCLR and C-BYOL self-supervised visual representation methods from "Compressive Visual Representa…☆37Updated 3 years ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆95Updated 3 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆206Updated 2 years ago
- An unopinionated replacement for PyTorch's Dataset and ImageFolder, that handles Tar archives☆77Updated 3 years ago
- PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)☆144Updated 3 years ago
- Implementation of RQ Transformer, proposed in the paper "Autoregressive Image Generation using Residual Quantization"☆122Updated 3 years ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Updated 2 years ago
- ☆69Updated 4 years ago
- A Domain-Agnostic Benchmark for Self-Supervised Learning☆106Updated 2 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- Implementation of LogAvgExp for Pytorch☆37Updated 6 months ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆120Updated 4 years ago
- (ICML 2022) Official PyTorch implementation of “Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Rob…☆78Updated 3 years ago
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆141Updated 7 months ago
- Examples for the WebDataset PyTorch Dataset Library☆51Updated 4 years ago
- Adaptive Gradient Clipping☆151Updated 3 years ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆105Updated 3 years ago
- Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch☆42Updated 2 years ago
- A better PyTorch implementation of image local attention which reduces the GPU memory by an order of magnitude.☆140Updated 3 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆102Updated 3 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Updated 2 years ago
- ☆75Updated 2 years ago
- Code to accompany the paper "Hierarchical Quantized Autoencoders"☆37Updated 2 years ago
- Layerwise Batch Entropy Regularization☆23Updated 3 years ago