ShigekiKarita / pytorch-distributed-slurm-example
☆42Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for pytorch-distributed-slurm-example
- ☆61Updated 4 years ago
- Distributed, mixed-precision training with PyTorch☆89Updated 4 years ago
- An implementation of shampoo☆74Updated 6 years ago
- "Layer-wise Adaptive Rate Scaling" in PyTorch☆86Updated 3 years ago
- Xuhong Li, Yves Grandvalet, and Franck Davoine. "Explicit Inductive Bias for Transfer Learning with Convolutional Networks." In ICML 2018…☆55Updated 6 years ago
- A plug-in replacement for DataLoader to load Imagenet disk-sequentially in PyTorch.☆238Updated 3 years ago
- Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…☆45Updated 5 years ago
- Knowledge Transfer via Distillation of Activation Boundaries Formed by Hidden Neurons (AAAI 2019)☆104Updated 5 years ago
- ☆165Updated 5 years ago
- ☆47Updated 3 years ago
- Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"☆98Updated 3 years ago
- Code for "Are labels necessary for neural architecture search"☆92Updated 7 months ago
- Implementation of the reversible residual network in pytorch☆101Updated 2 years ago
- A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).☆141Updated 4 years ago
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 4 years ago
- Code for SelfAugment☆27Updated 3 years ago
- Implementation of ICLR 2017 paper "Loss-aware Binarization of Deep Networks"☆18Updated 5 years ago
- (CVPR 2020) This repo contains code for "PADS: Policy-Adapted Sampling for Visual Similarity Learning", which proposes learnable triplet …☆60Updated 4 years ago
- Code for paper "SWALP: Stochastic Weight Averaging forLow-Precision Training".☆62Updated 5 years ago
- Pytorch implementation of SNAS☆75Updated 5 years ago
- custom cuda kernel for {2, 3}d relative attention with pytorch wrapper☆43Updated 4 years ago
- A second-order optimizer for deep networks☆24Updated 5 years ago
- Implementation and experiments for AdamW on Pytorch☆93Updated 4 years ago
- pytorch lmdb dataset with protobuf☆52Updated 5 years ago
- Code for SegTree Transformer (ICLR-RLGM 2019).☆27Updated 4 years ago
- tunz's CUDA pytorch operator (MaskedSoftmax)☆74Updated 5 years ago
- BlockDrop: Dynamic Inference Paths in Residual Networks☆140Updated last year
- Tensorflow implementation of "Representation Learning with Contrastive Predictive Coding"☆64Updated 5 years ago
- ☆63Updated 3 years ago