pytorch / test-infraLinks
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic to track disabled tests and slow tests, as well as our continuation integration jobs HUD/dashboard.
☆103Updated this week
Alternatives and similar repositories for test-infra
Users that are interested in test-infra are comparing it to the libraries listed below
Sorting:
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆161Updated last month
- PyTorch RFCs (experimental)☆135Updated 5 months ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆399Updated last week
- A library to analyze PyTorch traces.☆426Updated last week
- Home for OctoML PyTorch Profiler☆114Updated 2 years ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆306Updated 2 weeks ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆360Updated this week
- TorchFix - a linter for PyTorch-using code with autofix support☆148Updated 2 months ago
- Provide Python access to the NVML library for GPU diagnostics☆249Updated 2 months ago
- Torch Distributed Experimental☆117Updated last year
- ☆337Updated last week
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆232Updated this week
- ☆181Updated last year
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆216Updated last week
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆885Updated last week
- ☆145Updated last week
- ☆252Updated last year
- A tensor-aware point-to-point communication primitive for machine learning☆275Updated last week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆199Updated last week
- This repository contains the experimental PyTorch native float8 training UX☆223Updated last year
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆446Updated this week
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆181Updated 2 months ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆585Updated this week
- Implementation of a Transformer, but completely in Triton☆276Updated 3 years ago
- TORCH_LOGS parser for PT2☆64Updated this week
- Stores documents and resources used by the OpenXLA developer community☆131Updated last year
- extensible collectives library in triton☆90Updated 7 months ago
- Applied AI experiments and examples for PyTorch☆303Updated 2 months ago
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆65Updated 4 months ago
- ☆190Updated 2 weeks ago