tensorflow / meshLinks

Mesh TensorFlow: Model Parallelism Made Easier

☆1,624

Alternatives and similar repositories for mesh

Users that are interested in mesh are comparing it to the libraries listed below

Sorting:

facebookresearch / fairscale
PyTorch extensions for high performance and large scale training.
☆3,386Updated 7 months ago
pytorch / elastic
PyTorch elastic training
☆729Updated 3 years ago
openai / blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
☆1,061Updated 2 years ago
cybertronai / gradient-checkpointing
Make huge neural nets fit in memory
☆2,823Updated 5 years ago
openai / sparse_attention
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
☆1,598Updated 5 years ago
tensorflow / lingvo
Lingvo
☆2,854Updated 2 weeks ago
tensorflow / io
Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO
☆734Updated last week
tensorflow / runtime
A performant and modular runtime for TensorFlow
☆757Updated 2 months ago
pytorch / xla
Enabling PyTorch on XLA Devices (e.g. Google TPU)
☆2,709Updated this week
kakaobrain / torchgpipe
A GPipe implementation in PyTorch
☆857Updated last year
Tencent / TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
☆1,534Updated 4 months ago
pytorch / gloo
Collective communications library with various primitives for multi-machine training.
☆1,370Updated last week
tensorflow / model-analysis
Model analysis tools for TensorFlow
☆1,268Updated 3 months ago
mlcommons / training
Reference implementations of MLPerf® training benchmarks
☆1,729Updated last week
bigscience-workshop / bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
☆1,006Updated last year
NVIDIA / framework-reproducibility
Providing reproducibility in deep learning frameworks
☆432Updated last year
pytorch / FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
☆1,487Updated this week
google / seqio
Task-based datasets, preprocessing, and evaluation for sequence models.
☆589Updated 2 weeks ago
pytorch / torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,067Updated last year
tensorflow / tfx
TFX is an end-to-end platform for deploying production ML pipelines
☆2,170Updated last month
tensorflow / text
Making text a first-class citizen in TensorFlow.
☆1,279Updated this week
huggingface / pytorch_block_sparse
Fast Block Sparse Matrices for Pytorch
☆550Updated 4 years ago
keroro824 / HashingDeepLearning
Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"
☆1,104Updated 4 years ago
microsoft / fastformers
FastFormers - highly efficient transformer models for NLU
☆707Updated 8 months ago
microsoft / archai
Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.
☆482Updated last week
tensorflow / addons
Useful extra functionality for TensorFlow 2.x maintained by SIG-addons
☆1,706Updated 2 months ago
openxla / xprof
A profiling and performance analysis tool for machine learning
☆449Updated this week
nyu-mll / jiant
jiant is an nlp toolkit
☆1,672Updated 2 years ago
tunib-ai / parallelformers
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
☆791Updated 2 years ago
tensorflow / benchmarks
A benchmark framework for Tensorflow
☆1,147Updated 2 years ago