Kipok / understanding-momentum
Code for the paper "Understanding the Role of Momentum in Stochastic Gradient Methods"
☆15Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for understanding-momentum
- A simple middleware to improving GPU utilization then speedup online inference.☆19Updated 3 years ago
- Partially Adaptive Momentum Estimation method in the paper "Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep …☆39Updated last year
- Colab notebooks for d2l-book☆11Updated 4 years ago
- ☆23Updated 4 years ago
- ☆13Updated 6 years ago
- Implementation of the LOSSGRAD optimization algorithm☆15Updated 5 years ago
- Exploiting Uncertainty of Loss Landscape for Stochastic Optimization☆15Updated 5 years ago
- SSL using PyTorch☆49Updated 4 years ago
- Pytorch implementation of "Hallucinating Agnostic Images to Generalize Across Domains"☆11Updated 5 years ago
- Repository for the code of the paper "Neural Networks Regularization Through Class-wise Invariant Representation Learning".☆12Updated 7 years ago
- Various implementations and experimentation for deep neural network model compression☆24Updated 6 years ago
- ☆23Updated 5 years ago
- Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.☆23Updated 4 years ago
- Implementation for <Neural Similarity Learning> in NeurIPS'19.☆33Updated 4 years ago
- a replicate of https://arxiv.org/pdf/1711.00937.pdf☆16Updated 6 years ago
- Implementation of Spectral Leakage and Rethinking the Kernel Size in CNNs in Pytorch☆14Updated 3 years ago
- Lifelong Variational Autoencoder☆14Updated 6 years ago
- Reversible Recurrent Neural Network Pytorch Implementation☆21Updated 6 years ago
- ☆34Updated 5 years ago
- Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…☆45Updated 5 years ago
- Official Pytorch Implementation for the paper 'SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients'☆17Updated 2 years ago
- ☆39Updated 5 years ago
- Easy Multiprocessing for Python☆43Updated 4 years ago
- ☆15Updated 3 months ago
- Official implementation for "Minimax Active Learning" in PyTorch.☆9Updated 3 years ago
- Meta-SGD Algorithms Implementation☆21Updated 3 months ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Updated 3 years ago