david-wb / softargmaxLinks
A differentiable argmax function for PyTorch
☆43Updated 5 years ago
Alternatives and similar repositories for softargmax
Users that are interested in softargmax are comparing it to the libraries listed below
Sorting:
- Official PyTorch code for the paper "Improving Fractal Pre-training"☆27Updated 2 years ago
- Pre-training without Natural Images (ACCV 2020 Best Paper Honorable Mention Award)☆213Updated 3 years ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, de…☆102Updated 3 years ago
- Simple transformer implementations that I can understand☆20Updated 3 years ago
- t-vMF Similarity for Regularizing Intra-Class Feature Distribution☆21Updated 4 years ago
- (ICML 2022) Official PyTorch implementation of “Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Rob…☆78Updated 3 years ago
- PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)☆144Updated 3 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Updated 2 years ago
- ☆25Updated 2 years ago
- Paper Reading List I have already read☆30Updated last year
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆138Updated 5 months ago
- Replacing Labeled Real-Image Datasets with Auto-Generated Contours (CVPR 2022)☆43Updated 2 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆208Updated last year
- [NeurIPS 2021] Official codes for "Efficient Training of Visual Transformers with Small Datasets".☆144Updated 8 months ago
- A collection of differentiable SVD methods and ICCV21 "Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance P…☆78Updated last year
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆134Updated 4 years ago
- ☆68Updated 3 years ago
- The PyTorch implementation of Latent Video Transformer.☆99Updated last year
- Package for working with hypernetworks in PyTorch.☆129Updated last year
- PyTorch implementation of D2C: Diffuison-Decoding Models for Few-shot Conditional Generation.☆125Updated 3 years ago
- Implementation of Fast Transformer in Pytorch☆175Updated 4 years ago
- [NeurIPS'21] "Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly", Tianlong Chen, Yu Cheng, Zhe …☆85Updated 3 years ago
- Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding☆56Updated 11 months ago
- ☆66Updated 2 years ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆94Updated 4 years ago
- PyTorch implementation of "PatchVAE: Learning Local Latent Codes for Recognition" to appear in CVPR 2020☆14Updated 5 years ago
- This repository contains the code for our paper "Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguo…☆41Updated 2 years ago
- Implementation for our WACV 2021 paper "Multi-Loss Weighting with Coefficient of Variations"☆50Updated 4 years ago