BobMcDear / simsiam-pytorch
PyTorch implementation of SimSiam
☆8Updated last year
Related projects ⓘ
Alternatives and complementary repositories for simsiam-pytorch
- ☆9Updated 10 months ago
- [ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang☆26Updated 2 years ago
- ☆42Updated 6 years ago
- PyTorch reimplementation of the Smooth ReLU activation function proposed in the paper "Real World Large Scale Recommendation Systems Repr…☆21Updated 2 years ago
- ☆24Updated 3 years ago
- Experiments in Recurrent Highway Networks with Grouped Auxiliary Memory paper☆20Updated 4 years ago
- Example code for the NNGeometry PyTorch library☆10Updated 2 years ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆26Updated 3 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆47Updated last year
- Code for testing DCT plus Sparse (DCTpS) networks☆14Updated 3 years ago
- Layerwise Batch Entropy Regularization☆22Updated 2 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆14Updated 4 years ago
- PyTorch implementation of the vision transformer☆19Updated last year
- Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.☆23Updated last year
- Implémentation of the article **Deep Learning CUDA Memory Usage and Pytorch optimization tricks**☆42Updated 4 years ago
- ☆21Updated last month
- Usable implementation of Mogrifier, a circuit for enhancing LSTMs and potentially other networks, from Deepmind☆16Updated 5 months ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated last year
- Blog post☆16Updated 9 months ago
- ☆17Updated 4 years ago
- code for the ddp tutorial☆32Updated 2 years ago
- ☆45Updated 4 months ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Updated 6 years ago
- Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)☆17Updated 3 years ago
- A study on the following problems: what the memorization problem is in meta-learning; why memorization problem happens; and how we can pr…☆20Updated last year
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 2 years ago
- ☆15Updated last year
- ☆24Updated 4 years ago
- pytorch implementation of "A Theoretically Grounded Application of Dropout in Recurrent Neural Networks" LSTM(https://arxiv.org/abs/1512.…☆19Updated 4 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆15Updated 3 years ago