izmailovpavel / torch_swa_examplesView external linksLinks
☆47Jan 11, 2021Updated 5 years ago
Alternatives and similar repositories for torch_swa_examples
Users that are interested in torch_swa_examples are comparing it to the libraries listed below
Sorting:
- This is an article about using variational autoencoders for the generation of new data. It contains the code for generating the plots and…☆12Feb 15, 2021Updated 5 years ago
- ☆62Apr 19, 2022Updated 3 years ago
- Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"☆33Oct 19, 2022Updated 3 years ago
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Sep 18, 2023Updated 2 years ago
- ☆10Mar 20, 2021Updated 4 years ago
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- ☆10Jun 3, 2019Updated 6 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10May 16, 2018Updated 7 years ago
- Spell and pronounce words with a neural network☆10Feb 13, 2017Updated 9 years ago
- ☆24May 1, 2025Updated 9 months ago
- ☆252Dec 27, 2022Updated 3 years ago
- ☆12Sep 26, 2019Updated 6 years ago
- Official code for Coupled Oscillatory RNN (ICLR 2021, Oral)☆53Aug 26, 2021Updated 4 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- Official PyTorch implementation of the paper : ProbAct: A Probabilistic Activation Function for Deep Neural Networks.☆13Jun 10, 2019Updated 6 years ago
- Implementation of "Structured Multi-Hashing for Model Compression" (CVPR 2020)☆12Feb 18, 2021Updated 4 years ago
- Code accompanying VarGrad: A Low-Variance Gradient Estimator for Variational Inference☆12Oct 12, 2020Updated 5 years ago
- Very simple and short implementation of gradient boosting in 18 lines of code☆10Sep 17, 2020Updated 5 years ago
- A PyTorch implementation of "Meta-Amortized Variational Inference and Learning" (https://arxiv.org/abs/1902.01950)☆14Mar 31, 2020Updated 5 years ago
- Experiments for the NeurIPS 2021 paper "Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks"☆13Oct 25, 2021Updated 4 years ago
- Sequence data label generation and ingestion into deep learning models☆12Nov 17, 2021Updated 4 years ago
- Implements stochastic line search☆118Mar 14, 2023Updated 2 years ago
- Low-variance, efficient and unbiased gradient estimation for optimizing models with binary latent variables. (ICLR 2019)☆27Mar 9, 2019Updated 6 years ago
- Code for the paper "Bayesian Neural Network Priors Revisited"☆60Jul 1, 2021Updated 4 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Oct 30, 2020Updated 5 years ago
- Data-free knowledge distillation using Gaussian noise (NeurIPS paper)☆15Mar 24, 2023Updated 2 years ago
- This repository provides code source used in the paper: A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off☆13May 30, 2019Updated 6 years ago
- An empirical investigation of deep learning theory☆16Oct 3, 2019Updated 6 years ago
- Code for "Training Deep Energy-Based Models with f-Divergence Minimization" ICML 2020☆37Mar 24, 2023Updated 2 years ago
- ☆30Oct 26, 2020Updated 5 years ago
- Fork of diux-dev/imagenet18☆16Oct 4, 2018Updated 7 years ago
- [BMVC 2022] Information Theoretic Representation Distillation☆19Oct 6, 2023Updated 2 years ago
- Code for 'Periodic Activation Functions Induce Stationarity' (NeurIPS 2021)☆19Oct 27, 2021Updated 4 years ago
- The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.☆20Jun 16, 2023Updated 2 years ago
- Self-Distillation with weighted ground-truth targets; ResNet and Kernel Ridge Regression☆19Oct 12, 2021Updated 4 years ago
- Tools for working with Long Short-Term Memory (LSTM) networks and sequences in Pytorch☆36Jan 29, 2021Updated 5 years ago
- Experiments from the paper "On Second Order Behaviour in Augmented Neural ODEs"☆61Sep 30, 2024Updated last year
- ☆35Aug 2, 2019Updated 6 years ago
- Public Codebase for Rethinking Parameter Counting: Effective Dimensionality Revisited☆37Dec 27, 2022Updated 3 years ago