tensorflow / mesh
Mesh TensorFlow: Model Parallelism Made Easier
☆1,598Updated last year
Alternatives and similar repositories for mesh:
Users that are interested in mesh are comparing it to the libraries listed below
- PyTorch extensions for high performance and large scale training.☆3,232Updated this week
- Make huge neural nets fit in memory☆2,745Updated 4 years ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,030Updated last year
- A GPipe implementation in PyTorch☆821Updated 5 months ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,539Updated 4 years ago
- PyTorch elastic training☆730Updated 2 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,240Updated this week
- Useful extra functionality for TensorFlow 2.x maintained by SIG-addons☆1,693Updated 4 months ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,016Updated 9 months ago
- FastFormers - highly efficient transformer models for NLU☆703Updated last year
- Pytorch domain library for recommendation systems☆2,007Updated this week
- Code and model for the paper "Improving Language Understanding by Generative Pre-Training"☆2,176Updated 5 years ago
- JAX-based neural network library☆2,939Updated last month
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,345Updated 9 months ago
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,511Updated this week
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,503Updated last year
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,240Updated 3 months ago
- Collective communications library with various primitives for multi-machine training.☆1,253Updated 2 weeks ago
- NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale da…☆1,063Updated 4 months ago
- Fast Block Sparse Matrices for Pytorch☆547Updated 3 years ago
- Reference implementations of MLPerf™ training benchmarks☆1,636Updated this week
- Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀☆1,671Updated 2 months ago
- jiant is an nlp toolkit☆1,657Updated last year
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,510Updated 3 weeks ago
- ☆2,723Updated this week
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,135Updated 10 months ago
- A performant and modular runtime for TensorFlow☆759Updated last month
- Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training☆1,749Updated this week
- Longformer: The Long-Document Transformer☆2,072Updated last year
- Generate embeddings from large-scale graph-structured data.☆3,394Updated 10 months ago