tensorflow / mesh
Mesh TensorFlow: Model Parallelism Made Easier
☆1,605Updated last year
Alternatives and similar repositories for mesh:
Users that are interested in mesh are comparing it to the libraries listed below
- PyTorch extensions for high performance and large scale training.☆3,293Updated this week
- PyTorch elastic training☆730Updated 2 years ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,038Updated last year
- A GPipe implementation in PyTorch☆835Updated 8 months ago
- FastFormers - highly efficient transformer models for NLU☆705Updated 3 weeks ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,582Updated this week
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,564Updated last year
- Make huge neural nets fit in memory☆2,781Updated 4 years ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,078Updated 7 months ago
- A performant and modular runtime for TensorFlow☆759Updated last month
- Collective communications library with various primitives for multi-machine training.☆1,288Updated this week
- Training and serving large-scale neural networks with auto parallelization.☆3,122Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,518Updated last week
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,039Updated 11 months ago
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.☆990Updated 8 months ago
- ☆1,552Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,382Updated last year
- MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.☆1,999Updated 2 weeks ago
- jiant is an nlp toolkit☆1,666Updated last year
- ☆2,782Updated this week
- Lingvo☆2,837Updated last month
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,828Updated last year
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆723Updated this week
- JAX-based neural network library☆3,010Updated this week
- Longformer: The Long-Document Transformer☆2,104Updated 2 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,350Updated last year
- Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"☆1,095Updated 4 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,566Updated 4 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,293Updated this week
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,683Updated 5 months ago