tensorflow / meshLinks
Mesh TensorFlow: Model Parallelism Made Easier
☆1,624Updated 2 years ago
Alternatives and similar repositories for mesh
Users that are interested in mesh are comparing it to the libraries listed below
Sorting:
- PyTorch extensions for high performance and large scale training.☆3,386Updated 7 months ago
- PyTorch elastic training☆729Updated 3 years ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,061Updated 2 years ago
- Make huge neural nets fit in memory☆2,823Updated 5 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,598Updated 5 years ago
- Lingvo☆2,854Updated 2 weeks ago
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆734Updated last week
- A performant and modular runtime for TensorFlow☆757Updated 2 months ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,709Updated this week
- A GPipe implementation in PyTorch☆857Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,534Updated 4 months ago
- Collective communications library with various primitives for multi-machine training.☆1,370Updated last week
- Model analysis tools for TensorFlow☆1,268Updated 3 months ago
- Reference implementations of MLPerf® training benchmarks☆1,729Updated last week
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.☆1,006Updated last year
- Providing reproducibility in deep learning frameworks☆432Updated last year
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,487Updated this week
- Task-based datasets, preprocessing, and evaluation for sequence models.☆589Updated 2 weeks ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,067Updated last year
- TFX is an end-to-end platform for deploying production ML pipelines☆2,170Updated last month
- Making text a first-class citizen in TensorFlow.☆1,279Updated this week
- Fast Block Sparse Matrices for Pytorch☆550Updated 4 years ago
- Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"☆1,104Updated 4 years ago
- FastFormers - highly efficient transformer models for NLU☆707Updated 8 months ago
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆482Updated last week
- Useful extra functionality for TensorFlow 2.x maintained by SIG-addons☆1,706Updated 2 months ago
- A profiling and performance analysis tool for machine learning☆449Updated this week
- jiant is an nlp toolkit☆1,672Updated 2 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆791Updated 2 years ago
- A benchmark framework for Tensorflow☆1,147Updated 2 years ago