mlcommons / logging
MLPerf™ logging library
☆32Updated this week
Alternatives and similar repositories for logging:
Users that are interested in logging are comparing it to the libraries listed below
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆64Updated 2 years ago
- ☆59Updated last week
- An IR for efficiently simulating distributed ML computation.☆27Updated last year
- Benchmarks to capture important workloads.☆29Updated 2 weeks ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆38Updated 11 months ago
- TileFusion is a highly efficient kernel template library designed to elevate the level of abstraction in CUDA C for processing tiles.☆53Updated this week
- A Python library transfers PyTorch tensors between CPU and NVMe☆103Updated 2 months ago
- ☆18Updated this week
- oneCCL Bindings for Pytorch*☆88Updated last month
- Home for OctoML PyTorch Profiler☆107Updated last year
- Issues related to MLPerf™ Inference policies, including rules and suggested changes☆59Updated last week
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated 11 months ago
- Bandwidth test for ROCm☆54Updated this week
- ☆67Updated 3 months ago
- Framework to reduce autotune overhead to zero for well known deployments.☆61Updated 2 weeks ago
- High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.☆97Updated 7 months ago
- RCCL Performance Benchmark Tests☆59Updated 3 weeks ago
- Samples demonstrating how to use the Compute Sanitizer Tools and Public API☆75Updated last year
- Issues related to MLPerf™ training policies, including rules and suggested changes☆94Updated 2 months ago
- GPTQ inference TVM kernel☆38Updated 9 months ago
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆24Updated last week
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆86Updated this week
- ☆42Updated last month
- [ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining☆12Updated last year
- benchmarking some transformer deployments☆26Updated last year
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆38Updated 9 months ago
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆43Updated this week
- A tracing JIT for PyTorch☆17Updated 2 years ago
- CUDA Templates for Linear Algebra Subroutines☆14Updated this week