davidrpugh / horovod-gpu-data-science-project
Template repository for a Python 3-based data science project that uses Horovod.
☆43Updated 3 years ago
Alternatives and similar repositories for horovod-gpu-data-science-project:
Users that are interested in horovod-gpu-data-science-project are comparing it to the libraries listed below
- A collection of optimizers, some arcane others well known, for Flax.☆29Updated 3 years ago
- ☆36Updated 3 years ago
- Code for the anonymous submission "Cockpit: A Practical Debugging Tool for Training Deep Neural Networks"☆31Updated 4 years ago
- Companion code for a tutorial on using Hydra.☆29Updated 3 years ago
- Torch Distributed Experimental☆115Updated 7 months ago
- ☆25Updated 3 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆103Updated 3 years ago
- Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention (CVPR 2022)☆20Updated 2 years ago
- A Machine Learning workflow for Slurm.☆149Updated 4 years ago
- Collection of snippets for PyTorch users☆25Updated 3 years ago
- High performance pytorch modules☆18Updated 2 years ago
- [ICML 2020] code for "PowerNorm: Rethinking Batch Normalization in Transformers" https://arxiv.org/abs/2003.07845☆119Updated 3 years ago
- NeurIPS 2021 - Few-shot learning competition☆26Updated 3 years ago
- ☆22Updated 2 years ago
- ☆47Updated 4 years ago
- Minimal Reproducibility Study of (https://arxiv.org/abs/1911.05248). Experiments with Compression of Deep Neural Networks☆9Updated 3 years ago
- a lightweight transformer library for PyTorch☆71Updated 3 years ago
- ☆73Updated 2 years ago
- Code release to reproduce ASHA experiments from "Random Search and Reproducibility for NAS."☆22Updated 5 years ago
- 👑 Pytorch code for the Nero optimiser.☆20Updated 2 years ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆31Updated 4 years ago
- [CogSci'21] Study of human inductive biases in CNNs and Transformers.☆43Updated 3 years ago
- Tensorboard extension for Jupyterlab all in one☆89Updated 7 months ago
- Simply Numpy implementation of the FAVOR+ attention mechanism, https://teddykoker.com/2020/11/performers/☆37Updated 4 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆33Updated 4 years ago
- PyTorch implementation of HashedNets☆36Updated last year
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- ☆15Updated 4 years ago
- 👩 Pytorch and Jax code for the Madam optimiser.☆51Updated 4 years ago
- A selection of neural network models ported from torchvision for JAX & Flax.☆44Updated 4 years ago