ShigekiKarita / pytorch-distributed-slurm-example
☆42Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for pytorch-distributed-slurm-example
- ☆61Updated 4 years ago
- A second-order optimizer for deep networks☆24Updated 5 years ago
- Distributed, mixed-precision training with PyTorch☆89Updated 4 years ago
- pytorch lmdb dataset with protobuf☆52Updated 5 years ago
- custom cuda kernel for {2, 3}d relative attention with pytorch wrapper☆43Updated 4 years ago
- Efficient DataLoader for PyTorch and Keras for loading datasets from web servers and object stores.☆29Updated 5 years ago
- tunz's CUDA pytorch operator (MaskedSoftmax)☆75Updated 5 years ago
- Xuhong Li, Yves Grandvalet, and Franck Davoine. "Explicit Inductive Bias for Transfer Learning with Convolutional Networks." In ICML 2018…☆55Updated 6 years ago
- Implementation of the reversible residual network in pytorch☆101Updated 2 years ago
- An implementation of shampoo☆74Updated 6 years ago
- Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…☆45Updated 5 years ago
- ☆165Updated 5 years ago
- Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"☆98Updated 3 years ago
- An unofficial PyTorch implementation of the HAN and AdaHAN models presented in the "Learning Visual Question Answering by Bootstrapping H…☆54Updated 6 years ago
- Official code for "Writing Distributed Applications with PyTorch", PyTorch Tutorial☆255Updated last year
- Code for "Are labels necessary for neural architecture search"☆92Updated 8 months ago
- Implementation and experiments for AdamW on Pytorch☆93Updated 5 years ago
- Example of using PyTorch DistributedDataParallel and SLURM on skynet☆30Updated 3 years ago
- "Layer-wise Adaptive Rate Scaling" in PyTorch☆86Updated 3 years ago
- Filter Response Normalization tested on better ImageNet baselines.☆35Updated 4 years ago
- Profile the GPU memory usage of every line in a Pytorch code☆82Updated 6 years ago
- ☆47Updated 3 years ago
- PyTorch Implementations of Dropout Variants☆87Updated 6 years ago
- Adaptive Softmax implementation for PyTorch☆79Updated 5 years ago
- (Batched) advanced indexing for PyTorch.☆53Updated 11 months ago
- Utilities for Pytorch☆89Updated 2 years ago
- A PyTorch implementation of shake-shake☆111Updated 4 years ago
- Code for SelfAugment☆27Updated 3 years ago
- [ACL 2019] Visually Grounded Neural Syntax Acquisition☆89Updated 8 months ago