pseeth / autoclip
Adaptive Gradient Clipping
☆117Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for autoclip
- ☆164Updated last year
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 2 years ago
- Implementation of Nyström Self-attention, from the paper Nyströmformer☆122Updated 10 months ago
- Implementation of Feedback Transformer in Pytorch☆104Updated 3 years ago
- ☆22Updated last month
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆115Updated last year
- Code repository of the paper "Wavelet Networks: Scale-Translation Equivariant Learning From Raw Time-Series, TMLR" https://arxiv.org/abs…☆80Updated 10 months ago
- Online Normalization for Training Neural Networks (Companion Repository)☆79Updated 3 years ago
- Code to accompany the paper "Hierarchical Quantized Autoencoders"☆37Updated last year
- Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/21…☆117Updated last year
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆208Updated 6 months ago
- Jax/Flax implementation of Variational-DiffWave.☆40Updated 2 years ago
- Implementation of Flow++ in PyTorch☆40Updated 5 years ago
- Implementation of the Adan (ADAptive Nesterov momentum algorithm) Optimizer in Pytorch☆247Updated 2 years ago
- Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning☆155Updated 9 months ago
- Relative Positional Encoding for Transformers with Linear Complexity☆61Updated 2 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆207Updated last year
- Inspired by "Neural Networks Fail to Learn Periodic Functions and How to Fix It"☆58Updated 6 months ago
- Implementation of Linformer for Pytorch☆257Updated 10 months ago
- Collection of PyTorch Lightning implementations of Generative Adversarial Network varieties presented in research papers.☆167Updated 2 years ago
- Traditional Machine Learning Models for Large-Scale Datasets in PyTorch.☆126Updated 3 weeks ago
- PyTorch implementation of Sinusodial Representation networks (SIREN)☆263Updated last year
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆77Updated 9 months ago
- Pytorch Implementation of OpenAI's "Improved Variational Inference with Inverse Autoregressive Flow"☆80Updated 4 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆100Updated 3 years ago
- Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms☆251Updated 3 years ago
- Sequence Modeling with Structured State Spaces☆60Updated 2 years ago
- Unofficial PyTorch implementation of Google's FNet: Mixing Tokens with Fourier Transforms. With checkpoints.☆67Updated 2 years ago
- PyTorch reimplementation of per-channel energy normalization for audio.☆94Updated 5 years ago