tensorflow / meshLinks
Mesh TensorFlow: Model Parallelism Made Easier
☆1,615Updated last year
Alternatives and similar repositories for mesh
Users that are interested in mesh are comparing it to the libraries listed below
Sorting:
- PyTorch extensions for high performance and large scale training.☆3,376Updated 5 months ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,053Updated 2 years ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,688Updated this week
- PyTorch elastic training☆730Updated 3 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,588Updated 5 years ago
- Make huge neural nets fit in memory☆2,814Updated 5 years ago
- Reference implementations of MLPerf® training benchmarks☆1,715Updated last week
- A GPipe implementation in PyTorch☆855Updated last year
- A performant and modular runtime for TensorFlow☆760Updated 3 weeks ago
- Useful extra functionality for TensorFlow 2.x maintained by SIG-addons☆1,707Updated 3 weeks ago
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.☆1,006Updated last year
- Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO☆731Updated 3 weeks ago
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…☆1,584Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,530Updated 2 months ago
- Providing reproducibility in deep learning frameworks☆428Updated last year
- Lingvo☆2,850Updated this week
- Collective communications library with various primitives for multi-machine training.☆1,356Updated 2 weeks ago
- Long Range Arena for Benchmarking Efficient Transformers☆764Updated last year
- jiant is an nlp toolkit☆1,670Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆1,422Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,362Updated last year
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,063Updated last year
- Task-based datasets, preprocessing, and evaluation for sequence models.☆587Updated 2 weeks ago
- A benchmark framework for Tensorflow☆1,148Updated last year
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,444Updated this week
- Model analysis tools for TensorFlow☆1,267Updated last month
- Making text a first-class citizen in TensorFlow.☆1,273Updated this week
- maximal update parametrization (µP)☆1,605Updated last year
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆790Updated 2 years ago
- ☆2,889Updated this week