ChristophReich1996 / SmeLULinks
PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Reproducibility and Smooth Activations" [arXiv 2022].
☆22Updated 3 years ago
Alternatives and similar repositories for SmeLU
Users that are interested in SmeLU are comparing it to the libraries listed below
Sorting:
- ☆111Updated 3 years ago
- ☆75Updated 3 years ago
- Recycling diverse models☆46Updated 3 years ago
- A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in y…☆44Updated 3 years ago
- The official repository for "Intermediate Layers Matter in Momentum Contrastive Self Supervised Learning" paper.☆40Updated 3 years ago
- ☆37Updated 3 years ago
- Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)☆63Updated 3 years ago
- Layerwise Batch Entropy Regularization☆24Updated 3 years ago
- Reproducible code for Augmentation paper☆17Updated 7 years ago
- A regularized self-labeling approach to improve the generalization and robustness of fine-tuned models☆27Updated 3 years ago
- ☆57Updated 4 years ago
- Code and results accompanying our paper titled RLSbench: Domain Adaptation under Relaxed Label Shift☆35Updated 2 years ago
- ModelSoups for Tensorflow2 and Torch☆50Updated 3 years ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆82Updated 2 years ago
- Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"☆15Updated 4 years ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆32Updated 2 years ago
- A project to add scalable state-of-the-art out-of-distribution detection (open set recognition) support by changing two lines of code! Pe…☆79Updated 3 years ago
- ☆38Updated 2 years ago
- ☆15Updated 3 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Updated 3 years ago
- Cyclemoid implementation for PyTorch☆90Updated 3 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆58Updated 2 years ago
- Efficient Deep Learning Survey Paper☆34Updated 2 years ago
- A simple Jax implementation of influence functions.☆20Updated last year
- Github code for the paper Maximum Class Separation as Inductive Bias in One Matrix. Arxiv link: https://arxiv.org/abs/2206.08704☆30Updated 2 years ago
- ☆12Updated 3 years ago
- ☆58Updated 2 years ago
- ☆23Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆81Updated 2 years ago