pytorch / test-infraLinks
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic to track disabled tests and slow tests, as well as our continuation integration jobs HUD/dashboard.
☆96Updated this week
Alternatives and similar repositories for test-infra
Users that are interested in test-infra are comparing it to the libraries listed below
Sorting:
- PyTorch RFCs (experimental)☆133Updated 2 months ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆158Updated last month
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆378Updated this week
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆282Updated 3 weeks ago
- A library to analyze PyTorch traces.☆400Updated this week
- ☆323Updated last month
- TorchFix - a linter for PyTorch-using code with autofix support☆145Updated 5 months ago
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆206Updated this week
- Torch Distributed Experimental☆117Updated 11 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆345Updated this week
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆196Updated this week
- ☆142Updated 2 weeks ago
- ☆171Updated last year
- Implementation of a Transformer, but completely in Triton☆273Updated 3 years ago
- This repository contains the experimental PyTorch native float8 training UX☆224Updated last year
- ☆251Updated last year
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆180Updated 3 weeks ago
- JAX-Toolbox☆327Updated this week
- A tensor-aware point-to-point communication primitive for machine learning☆259Updated 2 years ago
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆372Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆191Updated this week
- Provide Python access to the NVML library for GPU diagnostics☆242Updated 8 months ago
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆842Updated last week
- jax-triton contains integrations between JAX and OpenAI Triton☆411Updated last month
- TORCH_LOGS parser for PT2☆47Updated last week
- The Triton backend for the PyTorch TorchScript models.☆157Updated last week
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆364Updated last month
- ☆187Updated this week
- Distributed preprocessing and data loading for language datasets☆39Updated last year