tensorflow / meshLinks
Mesh TensorFlow: Model Parallelism Made Easier
☆1,613Updated last year
Alternatives and similar repositories for mesh
Users that are interested in mesh are comparing it to the libraries listed below
Sorting:
- PyTorch extensions for high performance and large scale training.☆3,350Updated 3 months ago
- PyTorch elastic training☆729Updated 3 years ago
- Make huge neural nets fit in memory☆2,803Updated 5 years ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,044Updated 2 years ago
- A GPipe implementation in PyTorch☆846Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,528Updated 2 weeks ago
- Reference implementations of MLPerf™ training benchmarks☆1,696Updated last week
- FastFormers - highly efficient transformer models for NLU☆705Updated 4 months ago
- Collective communications library with various primitives for multi-machine training.☆1,332Updated this week
- A performant and modular runtime for TensorFlow☆758Updated 3 months ago
- Lingvo☆2,848Updated last month
- Model analysis tools for TensorFlow☆1,270Updated 2 weeks ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,647Updated this week
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.☆1,005Updated last year
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,415Updated this week
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆966Updated last week
- Providing reproducibility in deep learning frameworks☆428Updated last year
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆732Updated last month
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,056Updated last year
- Bagua Speeds up PyTorch☆882Updated last year
- Long Range Arena for Benchmarking Efficient Transformers☆762Updated last year
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,578Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,358Updated last year
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,688Updated 9 months ago
- Library for 8-bit optimizers and quantization routines.☆769Updated 2 years ago
- JAX-based neural network library☆3,068Updated this week
- jiant is an nlp toolkit☆1,670Updated 2 years ago
- Fast Block Sparse Matrices for Pytorch☆548Updated 4 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,583Updated 4 years ago
- Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"☆1,099Updated 4 years ago