jansel / pytorch-jit-paritybench
☆36Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for pytorch-jit-paritybench
- ☆48Updated 8 months ago
- System for automated integration of deep learning backends.☆48Updated 2 years ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆26Updated last year
- Benchmarks to capture important workloads.☆28Updated 5 months ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆63Updated 2 years ago
- ☆140Updated last year
- GEMM and Winograd based convolutions using CUTLASS☆25Updated 4 years ago
- Memory Optimizations for Deep Learning (ICML 2023)☆60Updated 8 months ago
- PyTorch RFCs (experimental)☆130Updated 2 months ago
- Codebase associated with the PyTorch compiler tutorial☆44Updated 5 years ago
- Python bindings for NVTX☆66Updated last year
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆37Updated 9 months ago
- ☆149Updated 5 months ago
- extensible collectives library in triton☆72Updated 2 months ago
- A library of GPU kernels for sparse matrix operations.☆249Updated 4 years ago
- oneCCL Bindings for Pytorch*☆86Updated 3 weeks ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆114Updated 2 years ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆271Updated this week
- Training neural networks in TensorFlow 2.0 with 5x less memory☆129Updated 2 years ago
- Home for OctoML PyTorch Profiler☆107Updated last year
- Prototype routines for GPU quantization written using PyTorch.☆19Updated 2 weeks ago
- Benchmark scripts for TVM☆73Updated 2 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆131Updated last year
- ☆12Updated 3 years ago
- Repository for SysML19 Artifacts Evaluation☆53Updated 5 years ago
- ☆55Updated 6 months ago
- MLIR-based partitioning system☆42Updated this week
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆146Updated this week
- Experimental projects related to TensorRT☆81Updated this week
- End to End steps for adding custom ops in PyTorch.☆19Updated 4 years ago