ChristophReich1996 / SmeLU
PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Reproducibility and Smooth Activations" [arXiv 2022].
☆22Updated 2 years ago
Alternatives and similar repositories for SmeLU:
Users that are interested in SmeLU are comparing it to the libraries listed below
- ☆73Updated 2 years ago
- The official repository for "Intermediate Layers Matter in Momentum Contrastive Self Supervised Learning" paper.☆40Updated 2 years ago
- ☆11Updated 2 years ago
- Efficient Deep Learning Survey Paper☆33Updated 2 years ago
- ☆22Updated 2 years ago
- Reproducible code for Augmentation paper☆17Updated 6 years ago
- Active and Sample-Efficient Model Evaluation☆24Updated 4 years ago
- Improving Transformation Invariance in Contrastive Representation Learning☆13Updated 4 years ago
- PyTorch implementation of FNet: Mixing Tokens with Fourier transforms☆25Updated 3 years ago
- Recycling diverse models☆44Updated 2 years ago
- PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)☆30Updated 3 years ago
- Testing various improvements to Ranger21 for 2022☆18Updated 4 months ago
- An education step by step implementation of SimCLR that accompanies the blogpost☆32Updated 2 years ago
- A regularized self-labeling approach to improve the generalization and robustness of fine-tuned models☆28Updated 2 years ago
- Code for the ICLR 2022 paper "Attention-based interpretability with Concept Transformers"☆40Updated last year
- A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in y…☆45Updated 2 years ago
- Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.☆56Updated 2 years ago
- ☆49Updated 2 years ago
- Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"☆15Updated 3 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆15Updated 3 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 11 months ago
- ☆18Updated 10 months ago
- Finetune Google's pre-trained ViT models from HuggingFace's model hub.☆18Updated 3 years ago
- ☆35Updated last year
- ModelSoups for Tensorflow2 and Torch☆48Updated 2 years ago
- Explanation Optimization☆13Updated 4 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆29Updated last month
- ☆33Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"☆19Updated last year
- Github code for the paper Maximum Class Separation as Inductive Bias in One Matrix. Arxiv link: https://arxiv.org/abs/2206.08704☆29Updated last year