tensorflow / mesh
Mesh TensorFlow: Model Parallelism Made Easier
☆1,602Updated last year
Alternatives and similar repositories for mesh:
Users that are interested in mesh are comparing it to the libraries listed below
- PyTorch extensions for high performance and large scale training.☆3,278Updated 2 months ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,547Updated this week
- PyTorch elastic training☆730Updated 2 years ago
- Make huge neural nets fit in memory☆2,771Updated 4 years ago
- A GPipe implementation in PyTorch☆836Updated 7 months ago
- JAX-based neural network library☆2,990Updated last week
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,560Updated 4 years ago
- Lingvo☆2,832Updated this week
- FastFormers - highly efficient transformer models for NLU☆704Updated last year
- ☆1,545Updated last year
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,039Updated last year
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,680Updated 4 months ago
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,559Updated last year
- Fast Block Sparse Matrices for Pytorch☆546Updated 4 years ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,071Updated 6 months ago
- Training and serving large-scale neural networks with auto parallelization.☆3,114Updated last year
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,112Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,378Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,350Updated 11 months ago
- Longformer: The Long-Document Transformer☆2,092Updated 2 years ago
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.☆988Updated 7 months ago
- Collective communications library with various primitives for multi-machine training.☆1,277Updated this week
- maximal update parametrization (µP)☆1,480Updated 8 months ago
- Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training☆1,773Updated this week
- Flax is a neural network library for JAX that is designed for flexibility.☆6,421Updated this week
- Task-based datasets, preprocessing, and evaluation for sequence models.☆571Updated 3 weeks ago
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,514Updated last year
- functorch is JAX-like composable function transforms for PyTorch.☆1,414Updated this week
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆978Updated last week
- Papers & presentation materials from Hugging Face's internal science day☆2,043Updated 4 years ago