tensorflow / meshLinks
Mesh TensorFlow: Model Parallelism Made Easier
☆1,607Updated last year
Alternatives and similar repositories for mesh
Users that are interested in mesh are comparing it to the libraries listed below
Sorting:
- PyTorch extensions for high performance and large scale training.☆3,322Updated last month
- PyTorch elastic training☆729Updated 2 years ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,607Updated this week
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,040Updated last year
- Make huge neural nets fit in memory☆2,794Updated 5 years ago
- Lingvo☆2,839Updated last week
- A GPipe implementation in PyTorch☆842Updated 10 months ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,342Updated this week
- Long Range Arena for Benchmarking Efficient Transformers☆757Updated last year
- FastFormers - highly efficient transformer models for NLU☆705Updated 2 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,391Updated last year
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,568Updated last year
- Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training☆1,797Updated this week
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,572Updated 4 years ago
- Reference implementations of MLPerf™ training benchmarks☆1,673Updated 2 weeks ago
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.☆997Updated 10 months ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,046Updated last year
- A performant and modular runtime for TensorFlow☆761Updated last month
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,084Updated 8 months ago
- ☆2,819Updated this week
- maximal update parametrization (µP)☆1,526Updated 10 months ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆574Updated 3 weeks ago
- JAX-based neural network library☆3,034Updated this week
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,839Updated last year
- Generate embeddings from large-scale graph-structured data.☆3,413Updated last year
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆472Updated 2 years ago
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆726Updated last month
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,353Updated last year
- Guide for building custom op for TensorFlow☆382Updated 2 years ago
- Making text a first-class citizen in TensorFlow.☆1,257Updated last week