pytorch / test-infraLinks
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic to track disabled tests and slow tests, as well as our continuation integration jobs HUD/dashboard.
☆97Updated this week
Alternatives and similar repositories for test-infra
Users that are interested in test-infra are comparing it to the libraries listed below
Sorting:
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆158Updated 2 months ago
- PyTorch RFCs (experimental)☆133Updated 2 months ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆382Updated this week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆288Updated last week
- A library to analyze PyTorch traces.☆404Updated last week
- ☆324Updated 3 weeks ago
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆348Updated this week
- TorchFix - a linter for PyTorch-using code with autofix support☆144Updated 6 months ago
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆208Updated last week
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆260Updated this week
- ☆171Updated last year
- TORCH_LOGS parser for PT2☆55Updated this week
- ☆143Updated 2 weeks ago
- Provide Python access to the NVML library for GPU diagnostics☆245Updated 8 months ago
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆206Updated this week
- This repository contains the experimental PyTorch native float8 training UX☆224Updated last year
- extensible collectives library in triton☆88Updated 4 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆413Updated 2 months ago
- Torch Distributed Experimental☆117Updated last year
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆383Updated last week
- ☆251Updated last year
- JAX-Toolbox☆329Updated last week
- MLPerf™ logging library☆36Updated this week
- A tensor-aware point-to-point communication primitive for machine learning☆262Updated last week
- MLIR-based partitioning system☆120Updated last week
- oneCCL Bindings for Pytorch*☆100Updated 2 weeks ago
- Applied AI experiments and examples for PyTorch☆290Updated 2 months ago
- Implementation of a Transformer, but completely in Triton☆273Updated 3 years ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆192Updated last week