albanD / pytorch_dev_env_setup
☆10Updated 6 months ago
Alternatives and similar repositories for pytorch_dev_env_setup:
Users that are interested in pytorch_dev_env_setup are comparing it to the libraries listed below
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆294Updated this week
- Convert nvprof profiles into about:tracing compatible JSON files☆68Updated 3 years ago
- Ahead of Time (AOT) Triton Math Library☆50Updated this week
- ☆48Updated 10 months ago
- Stores documents and resources used by the OpenXLA developer community☆114Updated 5 months ago
- extensible collectives library in triton☆77Updated 4 months ago
- ☆158Updated 7 months ago
- ☆36Updated last month
- Fastest kernels written from scratch☆131Updated 2 months ago
- MLIR-based partitioning system☆58Updated this week
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆26Updated last year
- Experimental projects related to TensorRT☆86Updated this week
- A library to analyze PyTorch traces.☆325Updated this week
- ☆64Updated 2 months ago
- Home for OctoML PyTorch Profiler☆107Updated last year
- CUTLASS and CuTe Examples☆37Updated 3 weeks ago
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆99Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆127Updated last year
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆86Updated last week
- ☆279Updated last week
- An experimental CPU backend for Triton☆81Updated last week
- ☆15Updated 4 months ago
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆106Updated last year
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆37Updated 6 months ago
- Shared Middle-Layer for Triton Compilation☆220Updated this week
- Applied AI experiments and examples for PyTorch☆216Updated last week
- IREE's PyTorch Frontend, based on Torch Dynamo.☆62Updated this week
- Learning about CUDA by writing PTX code.☆33Updated 11 months ago
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆84Updated this week
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆178Updated last month