pytorch / test-infraLinks
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic to track disabled tests and slow tests, as well as our continuation integration jobs HUD/dashboard.
☆105Updated this week
Alternatives and similar repositories for test-infra
Users that are interested in test-infra are comparing it to the libraries listed below
Sorting:
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆164Updated 3 weeks ago
- PyTorch RFCs (experimental)☆138Updated 8 months ago
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆412Updated this week
- ☆152Updated last month
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆329Updated this week
- ☆345Updated last week
- A library to analyze PyTorch traces.☆462Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆380Updated this week
- TorchFix - a linter for PyTorch-using code with autofix support☆152Updated 5 months ago
- ☆189Updated last year
- TORCH_TRACE parser for PT2☆76Updated this week
- ☆192Updated last week
- jax-triton contains integrations between JAX and OpenAI Triton☆437Updated last month
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆404Updated last month
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆262Updated this week
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- A tensor-aware point-to-point communication primitive for machine learning☆283Updated last month
- JAX-Toolbox☆382Updated this week
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆547Updated 3 weeks ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆739Updated this week
- Implementation of a Transformer, but completely in Triton☆279Updated 3 years ago
- This repository contains the experimental PyTorch native float8 training UX☆227Updated last year
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆475Updated last week
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆921Updated this week
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆130Updated last week
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆79Updated last month
- Tokamax: A GPU and TPU kernel library.☆170Updated this week
- An experimental implementation of compiler-driven automatic sharding of models across a given device mesh.☆52Updated this week
- ☆73Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆205Updated last week