pytorch / xla
Enabling PyTorch on XLA Devices (e.g. Google TPU)
☆2,489Updated this week
Related projects ⓘ
Alternatives and complementary repositories for xla
- PyTorch extensions for high performance and large scale training.☆3,195Updated last week
- Flax is a neural network library for JAX that is designed for flexibility.☆6,142Updated this week
- Make huge neural nets fit in memory☆2,730Updated 4 years ago
- JAX-based neural network library☆2,909Updated last week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,415Updated 2 weeks ago
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,158Updated this week
- High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.☆4,528Updated last week
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,011Updated 7 months ago
- functorch is JAX-like composable function transforms for PyTorch.☆1,396Updated this week
- Mesh TensorFlow: Model Parallelism Made Easier☆1,591Updated last year
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,597Updated this week
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,210Updated this week
- A GPipe implementation in PyTorch☆818Updated 3 months ago
- Compiler for Neural Network hardware accelerators☆3,236Updated 6 months ago
- Serve, optimize and scale PyTorch models in production☆4,218Updated 3 weeks ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,043Updated 7 months ago
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs…☆1,979Updated this week
- Optimized primitives for collective multi-GPU communication☆3,253Updated 2 months ago
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆2,710Updated this week
- Collective communications library with various primitives for multi-machine training.☆1,227Updated this week
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆875Updated this week
- PyTorch elastic training☆730Updated 2 years ago
- A domain specific language to express machine learning workloads.☆1,761Updated last year
- Profiling and inspecting memory in pytorch☆1,020Updated 3 months ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,635Updated this week
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆8,524Updated this week
- C++ extensions in PyTorch☆1,018Updated 3 months ago
- Toolbox of models, callbacks, and datasets for AI/ML researchers.☆1,695Updated 2 weeks ago
- Reference implementations of MLPerf™ training benchmarks☆1,617Updated last month
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,027Updated last year