tensorflow / meshLinks
Mesh TensorFlow: Model Parallelism Made Easier
☆1,608Updated last year
Alternatives and similar repositories for mesh
Users that are interested in mesh are comparing it to the libraries listed below
Sorting:
- PyTorch extensions for high performance and large scale training.☆3,331Updated last month
- PyTorch elastic training☆728Updated 3 years ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,041Updated 2 years ago
- A GPipe implementation in PyTorch☆843Updated 10 months ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,623Updated this week
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,051Updated last year
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,571Updated last year
- Collective communications library with various primitives for multi-machine training.☆1,315Updated this week
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,688Updated 7 months ago
- FastFormers - highly efficient transformer models for NLU☆705Updated 3 months ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,912Updated 2 years ago
- Make huge neural nets fit in memory☆2,797Updated 5 years ago
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,526Updated 2 months ago
- functorch is JAX-like composable function transforms for PyTorch.☆1,432Updated this week
- jiant is an nlp toolkit☆1,669Updated last year
- Training and serving large-scale neural networks with auto parallelization.☆3,138Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,356Updated last year
- ☆2,833Updated 2 weeks ago
- Lingvo☆2,843Updated this week
- Language-Agnostic SEntence Representations☆3,644Updated last year
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.☆1,000Updated 10 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,395Updated last year
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,383Updated this week
- The implementation of DeBERTa☆2,101Updated last year
- Task-based datasets, preprocessing, and evaluation for sequence models.☆576Updated last month
- Library for 8-bit optimizers and quantization routines.☆716Updated 2 years ago
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆473Updated 3 years ago
- Fast Block Sparse Matrices for Pytorch☆547Updated 4 years ago
- Making text a first-class citizen in TensorFlow.☆1,257Updated this week
- Longformer: The Long-Document Transformer☆2,134Updated 2 years ago