tensorflow / mesh
Mesh TensorFlow: Model Parallelism Made Easier
☆1,605Updated last year
Alternatives and similar repositories for mesh:
Users that are interested in mesh are comparing it to the libraries listed below
- PyTorch extensions for high performance and large scale training.☆3,308Updated last week
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,520Updated 3 weeks ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,353Updated last year
- Make huge neural nets fit in memory☆2,786Updated 5 years ago
- FastFormers - highly efficient transformer models for NLU☆706Updated last month
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.☆993Updated 9 months ago
- A GPipe implementation in PyTorch☆837Updated 9 months ago
- ☆2,800Updated this week
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,039Updated last year
- PyTorch elastic training☆730Updated 2 years ago
- Long Range Arena for Benchmarking Efficient Transformers☆751Updated last year
- Task-based datasets, preprocessing, and evaluation for sequence models.☆574Updated this week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,386Updated last year
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,596Updated this week
- Pytorch library for fast transformer implementations☆1,701Updated 2 years ago
- Fast Block Sparse Matrices for Pytorch☆545Updated 4 years ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,683Updated 6 months ago
- Reference implementations of MLPerf™ training benchmarks☆1,664Updated 3 weeks ago
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,566Updated last year
- Lingvo☆2,838Updated this week
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,569Updated 4 years ago
- JAX-based neural network library☆3,021Updated this week
- ☆1,560Updated 2 years ago
- Fast and Easy Infinite Neural Networks in Python☆2,335Updated last year
- functorch is JAX-like composable function transforms for PyTorch.☆1,424Updated this week
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,040Updated last year
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,313Updated this week
- maximal update parametrization (µP)☆1,500Updated 9 months ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆472Updated 2 years ago
- Longformer: The Long-Document Transformer☆2,118Updated 2 years ago