bonlime / BAdamLinks
Adam with minor modifications which give significant improvement
☆19Updated 4 years ago
Alternatives and similar repositories for BAdam
Users that are interested in BAdam are comparing it to the libraries listed below
Sorting:
- (unofficial) - customized fork of DETR, optimized for intelligent obj detection on 'real world' custom datasets☆12Updated 5 years ago
- State-of-the-art data augmentation search algorithms in PyTorch☆47Updated 2 years ago
- Code for training on Imagenet to SOTA results using PyTorch☆13Updated 2 years ago
- Implementation of Spectral Leakage and Rethinking the Kernel Size in CNNs in Pytorch☆14Updated 4 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 4 years ago
- Unofficial PyTorch implementation of "Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Ne…☆22Updated 6 years ago
- Implementation of Kronecker Attention in Pytorch☆19Updated 5 years ago
- ☆55Updated 4 years ago
- ☆41Updated 4 years ago
- An open source implementation of CLIP.☆33Updated 3 years ago
- ☆15Updated 3 years ago
- Lightweight knowledge distillation pipeline☆28Updated 4 years ago
- Experiments with the ideas presented in https://arxiv.org/abs/2003.00152 by Frankle et al.☆29Updated 5 years ago
- A PyTorch Dataset that caches samples in shared memory, accessible globally to all processes☆23Updated 3 years ago
- Includes additional materials for the following keras.io blog post.☆12Updated 4 years ago
- PyTorch implementation of Contrastive Feature Loss for Image Prediction (AIM Workshop at ICCV 2021)☆55Updated 4 years ago
- Unofficial pytorch implementation of ReZero in ResNet☆24Updated 5 years ago
- diffGrad: An Optimization Method for Convolutional Neural Networks☆55Updated 3 years ago
- Advanced optimizer with Gradient-Centralization☆21Updated 5 years ago
- Large dataset storage format for Pytorch☆45Updated 4 years ago
- GAN models implemented with Pytorch Lightning and Hydra configuration☆33Updated 3 years ago
- Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.☆86Updated 4 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"☆99Updated 5 years ago
- Image data augmentation scheduler for albumentations transforms☆19Updated 4 years ago
- Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"☆15Updated 4 years ago
- ☆26Updated 5 years ago
- PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].☆18Updated 3 years ago
- QUick and DIrty Domain Adaptation☆23Updated 2 years ago
- ☆12Updated 3 years ago