pytorch / test-infraLinks
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic to track disabled tests and slow tests, as well as our continuation integration jobs HUD/dashboard.
☆102Updated this week
Alternatives and similar repositories for test-infra
Users that are interested in test-infra are comparing it to the libraries listed below
Sorting:
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆161Updated last month
- PyTorch RFCs (experimental)☆135Updated 4 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆301Updated 2 weeks ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆395Updated last week
- A library to analyze PyTorch traces.☆416Updated last week
- ☆178Updated last year
- TorchFix - a linter for PyTorch-using code with autofix support☆149Updated 2 months ago
- Home for OctoML PyTorch Profiler☆114Updated 2 years ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆357Updated last week
- Provide Python access to the NVML library for GPU diagnostics☆248Updated last month
- Torch Distributed Experimental☆117Updated last year
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆389Updated last week
- ☆335Updated last month
- ☆145Updated last week
- This repository contains the experimental PyTorch native float8 training UX☆223Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆122Updated last month
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆420Updated last week
- A tensor-aware point-to-point communication primitive for machine learning☆273Updated 2 months ago
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆180Updated last month
- MLPerf™ logging library☆37Updated last week
- TORCH_LOGS parser for PT2☆62Updated last month
- PyTorch interface for the IPU☆181Updated 2 years ago
- The Triton backend for the PyTorch TorchScript models.☆160Updated last week
- extensible collectives library in triton☆89Updated 6 months ago
- Implementation of a Transformer, but completely in Triton☆276Updated 3 years ago
- jax-triton contains integrations between JAX and OpenAI Triton☆428Updated last week
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆226Updated last week
- ☆28Updated 3 months ago
- ☆190Updated 3 weeks ago
- JAX-Toolbox☆355Updated this week