shivram1987 / ActivationFunctionsLinks
☆53Updated 3 years ago
Alternatives and similar repositories for ActivationFunctions
Users that are interested in ActivationFunctions are comparing it to the libraries listed below
Sorting:
- Efficient Deep Learning Survey Paper☆33Updated 2 years ago
- PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Repr…☆22Updated 3 years ago
- Library - Vanilla, ViT, DeiT, BERT, GPT☆67Updated 4 years ago
- Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?☆17Updated 4 years ago
- Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.☆57Updated 3 years ago
- Implementation of transformers based architecture in PyTorch.☆54Updated 4 years ago
- Code repository of the paper "Modelling Long Range Dependencies in ND: From Task-Specific to a General Purpose CNN" https://arxiv.org/abs…☆183Updated 5 months ago
- Adaptive, interpretable wavelets across domains (NeurIPS 2021)☆81Updated 3 years ago
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)☆31Updated 4 years ago
- PyTorch implementation of FNet: Mixing Tokens with Fourier transforms☆28Updated 4 years ago
- A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.☆131Updated 10 months ago
- Code for the paper titled "Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks" (NeurIPS…☆11Updated 3 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Updated 4 years ago
- ☆60Updated 5 years ago
- Implementing RepVGG in PyTorch☆20Updated 3 years ago
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆28Updated 3 years ago
- Layerwise Batch Entropy Regularization☆23Updated 3 years ago
- PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].☆17Updated 3 years ago
- Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning☆165Updated last year
- Adversarial examples to the new ConvNeXt architecture☆20Updated 3 years ago
- Official repository for the paper "Zero-Shot AutoML with Pretrained Models"☆47Updated last year
- TedNet: A Pytorch Toolkit for Tensor Decomposition Networks☆97Updated 3 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆79Updated 4 years ago
- Lipschitz Recurrent Neural Networks☆30Updated 4 years ago
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆13Updated 3 years ago
- Code for the anonymous submission "Cockpit: A Practical Debugging Tool for Training Deep Neural Networks"☆31Updated 4 years ago
- A simple implementation of a deep linear Pytorch module☆21Updated 4 years ago
- FLOPs calculator with tf.profiler for neural network architecture written in tensorflow 2.2+ (tf.keras)☆55Updated last year
- Collect optimizer related papers, data, repositories☆97Updated 10 months ago