karlhigley / torch-optim-sparseLinks
PyTorch optimizers with sparse momentum and weight decay
☆10Updated 4 years ago
Alternatives and similar repositories for torch-optim-sparse
Users that are interested in torch-optim-sparse are comparing it to the libraries listed below
Sorting:
- Code for MSID, a Multi-Scale Intrinsic Distance for comparing generative models, studying neural networks, and more!☆51Updated 6 years ago
- Code for "Aggregated Momentum: Stability Through Passive Damping", Lucas et al. 2018☆34Updated 6 years ago
- Probabilistic classification in PyTorch/TensorFlow/scikit-learn with Fenchel-Young losses☆186Updated last year
- Exponential Machines implementation☆42Updated 5 months ago
- ☆45Updated 5 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Updated 5 years ago
- Дипломная работа бакалавра / Bachelor thesis☆11Updated 9 years ago
- PyTorch Flexible Hash Embeddings☆28Updated 5 years ago
- Public repository for the work on bandit problems☆23Updated last year
- Pytorch implementation of Variational Dropout Sparsifies Deep Neural Networks☆83Updated 3 years ago
- Implementations of quasi-hyperbolic optimization algorithms.☆102Updated 5 years ago
- Various experiments on the [Fashion-MNIST](https://github.com/zalandoresearch/fashion-mnist) dataset from Zalando☆31Updated 7 years ago
- A lightweight library for tensorflow 2.0☆66Updated 5 years ago
- PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.☆37Updated 7 years ago
- ☆50Updated 7 years ago
- Python implementation of GLN in different frameworks☆97Updated 4 years ago
- Blazingly fast capsule networks in 75 lines of pytorch+einops☆26Updated 3 years ago
- ☆16Updated 8 years ago
- ☆26Updated 6 years ago
- TBA☆76Updated 6 years ago
- ☆80Updated 7 years ago
- Generative Latent Attentive Sampler☆26Updated 8 years ago
- Graph-based learning in Python☆17Updated 7 years ago
- Simple ranking metrics for PyTorch on CPU or GPU☆15Updated 4 years ago
- Implementation of linear CorEx and temporal CorEx.☆37Updated 3 years ago
- Implementation of Counterfactual risk minimization☆26Updated 8 years ago
- An implementation of shampoo☆77Updated 7 years ago
- Kervolution implementation using TF2.0☆20Updated 2 years ago
- ☆71Updated 4 years ago
- Graduate topics course on learning discrete latent structure.☆67Updated 6 years ago