JGuillaumin / swa-tf
Stochastic Weight Averaging - TensorFlow implementation
☆33Updated 5 years ago
Alternatives and similar repositories for swa-tf:
Users that are interested in swa-tf are comparing it to the libraries listed below
- Python way to Read/Write TFRecords☆64Updated 6 years ago
- Prunable nn layers for pytorch.☆48Updated 6 years ago
- Various implementations and experimentation for deep neural network model compression☆24Updated 6 years ago
- Switch Normalization implementation for Keras 2+☆30Updated 6 years ago
- An optimizer that trains as fast as Adam and as good as SGD in Tensorflow☆45Updated 5 years ago
- "Learning Rate Dropout" in PyTorch☆34Updated 5 years ago
- Training RNNs as Fast as CNNs (Simple Recurrent Unit)☆30Updated 7 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated last year
- tunz's CUDA pytorch operator (MaskedSoftmax)☆75Updated 5 years ago
- PyTorch Examples repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆62Updated 5 months ago
- Adaptive Stochastic Natural Gradient Method for One-Shot Neural Architecture Search☆88Updated 5 years ago
- Implementation of Octave Convolution from Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convol…☆56Updated 5 years ago
- TensorFlow implementation of PNASNet-5 on ImageNet☆101Updated 6 years ago
- Simple Tensorflow implementation of "On The Variance Of The Adaptive Learning Rate And Beyond"☆97Updated 4 years ago
- Code for reproducing results of the paper "Layer rotation: a surprisingly powerful indicator of generalization in deep networks?"☆50Updated 5 years ago
- mixup: Beyond Empirical Risk Minimization☆99Updated 7 years ago
- Adaptive embedding and softmax☆17Updated 2 years ago
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 4 years ago
- Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization at CVPR'19☆48Updated 5 years ago
- Exploiting Uncertainty of Loss Landscape for Stochastic Optimization☆15Updated 5 years ago
- Octave convolution☆34Updated 2 years ago
- AdaBatch: Adaptive Batch Sizes for Training Deep Neural Networks☆41Updated 7 years ago
- Cyclic learning rate TensorFlow implementation.☆66Updated 5 years ago
- ☆29Updated 6 years ago
- Simple Tensorflow implementation of "On the Convergence of Adam and Beyond" (ICLR 2018)☆104Updated 5 years ago
- Corrupted labels and label smoothing☆128Updated 7 years ago
- WassersteinGAN-TensorFlow☆17Updated 7 years ago