pseeth / autoclipLinks
Adaptive Gradient Clipping
☆135Updated 2 years ago
Alternatives and similar repositories for autoclip
Users that are interested in autoclip are comparing it to the libraries listed below
Sorting:
- ☆163Updated 2 years ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- Relative Positional Encoding for Transformers with Linear Complexity☆64Updated 3 years ago
- A PyTorch Implementation of the Sparsemax operator (https://arxiv.org/pdf/1803.09820.pdf)☆34Updated 2 years ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms