facebookresearch / fairscale
PyTorch extensions for high performance and large scale training.
โ3,195Updated last week
Related projects โ
Alternatives and complementary repositories for fairscale
- ๐ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iโฆโ7,958Updated this week
- Transformer related optimization, including BERT, GPTโ5,890Updated 7 months ago
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.โ2,328Updated last month
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUsโฆโ1,979Updated this week
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackabโฆโ1,535Updated 9 months ago
- Accessible large language models via k-bit quantization for PyTorch.โ6,299Updated this week
- ๐ Accelerate training and inference of ๐ค Transformers and ๐ค Diffusers with easy to use hardware optimization toolsโ2,576Updated this week
- A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.โ1,134Updated this week
- Ongoing research training transformer models at scaleโ10,595Updated this week
- MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.โ1,904Updated this week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2โ1,893Updated last month
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.โ1,011Updated 7 months ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorchโ8,415Updated 2 weeks ago
- A Unified Library for Parameter-Efficient and Modular Transfer Learningโ2,581Updated 2 weeks ago
- Serve, optimize and scale PyTorch models in productionโ4,218Updated 3 weeks ago
- Mesh TensorFlow: Model Parallelism Made Easierโ1,591Updated last year
- Machine learning metrics for distributed, scalable PyTorch applications.โ2,137Updated this week
- ๐ค Evaluate: A library for easily evaluating machine learning models and datasets.โ2,037Updated 2 months ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for ๐ค Hugging Face transformer models ๐โ1,659Updated 3 weeks ago
- maximal update parametrization (ยตP)โ1,402Updated 4 months ago
- Foundation Architecture for (M)LLMsโ3,034Updated 7 months ago
- A concise but complete full-attention transformer with a set of promising experimental features from various papersโ4,793Updated this week
- Enabling PyTorch on XLA Devices (e.g. Google TPU)โ2,489Updated this week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2โ1,338Updated 8 months ago
- Flax is a neural network library for JAX that is designed for flexibility.โ6,142Updated this week
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRTโ2,597Updated this week
- A GPipe implementation in PyTorchโ818Updated 3 months ago
- FFCV: Fast Forward Computer Vision (and other ML workloads!)โ2,867Updated 5 months ago
- FlexFlow Serve: Low-Latency, High-Performance LLM Servingโ1,713Updated this week