facebookresearch / fairscaleLinks

PyTorch extensions for high performance and large scale training.

☆3,350

Alternatives and similar repositories for fairscale

Users that are interested in fairscale are comparing it to the libraries listed below

Sorting:

webdataset / webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
☆2,736Updated last month
NVIDIA / TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…
☆2,587Updated this week
tensorflow / mesh
Mesh TensorFlow: Model Parallelism Made Easier
☆1,613Updated last year
pytorch / data
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
☆1,213Updated this week
NVIDIA / FasterTransformer
Transformer related optimization, including BERT, GPT
☆6,261Updated last year
microsoft / mup
maximal update parametrization (µP)
☆1,569Updated last year
ELS-RD / kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackab…
☆1,578Updated last year
huggingface / accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…
☆8,971Updated last week
microsoft / torchscale
Foundation Architecture for (M)LLMs
☆3,097Updated last year
pytorch / benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
☆966Updated this week
Lightning-AI / torchmetrics
Machine learning metrics for distributed, scalable PyTorch applications.
☆2,316Updated last week
laekov / fastmoe
A fast MoE impl for PyTorch
☆1,766Updated 5 months ago
pytorch / torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,056Updated last year
pytorch / xla
Enabling PyTorch on XLA Devices (e.g. Google TPU)
☆2,647Updated this week
deepspeedai / Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆2,123Updated 2 weeks ago
deepspeedai / DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
☆2,042Updated last month
bigscience-workshop / Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆1,405Updated last year
bigscience-workshop / bigscience
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
☆1,005Updated last year
pytorch / TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
☆2,818Updated this week
idiap / fast-transformers
Pytorch library for fast transformer implementations
☆1,725Updated 2 years ago
ELS-RD / transformer-deploy
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀
☆1,688Updated 9 months ago
libffcv / ffcv
FFCV: Fast Forward Computer Vision (and other ML workloads!)
☆2,952Updated last year
google / flax
Flax is a neural network library for JAX that is designed for flexibility.
☆6,712Updated this week
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
☆7,400Updated last week
Lightning-Universe / lightning-bolts
Toolbox of models, callbacks, and datasets for AI/ML researchers.
☆1,734Updated 3 weeks ago
facebookresearch / xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
☆9,788Updated this week
arogozhnikov / einops
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
☆9,077Updated last month
cybertronai / gradient-checkpointing
Make huge neural nets fit in memory
☆2,803Updated 5 years ago
alpa-projects / alpa
Training and serving large-scale neural networks with auto parallelization.
☆3,143Updated last year
pytorch / functorch
functorch is JAX-like composable function transforms for PyTorch.
☆1,434Updated this week