cake-lab / DELILinks
Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying and Improving Performance of Distributed Deep Learning with Cloud Storage, to be published in IC2E 2021
☆12Updated 3 years ago
Alternatives and similar repositories for DELI
Users that are interested in DELI are comparing it to the libraries listed below
Sorting:
- AMD HPC Research Fund Cloud☆15Updated last month
- This is repository for a I/O benchmark which represents Scientific Deep Learning Workloads.☆23Updated 2 years ago
- MLPerf™ logging library☆37Updated this week
- ☆22Updated 2 weeks ago
- A tracing infrastructure for heterogeneous computing applications.☆35Updated this week
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆17Updated 2 months ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆21Updated 2 years ago
- ☆42Updated this week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆186Updated this week
- MANA for MPI☆42Updated 3 weeks ago
- COCCL: Compression and precision co-aware collective communication library☆23Updated 6 months ago
- Data and reproducibility scripts for the UoB-HPC Performance Portability studies☆17Updated last year
- Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.☆16Updated last year
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆61Updated 2 weeks ago
- This repository contains the results and code for the MLPerf™ Training v4.0 benchmark.☆12Updated last year
- SParse AcceleRation on Tensor Architecture☆17Updated 5 months ago
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆166Updated this week
- Benchmarks to capture important workloads.☆31Updated 7 months ago
- A library that translates Python and NumPy to optimized distributed systems code.☆132Updated 3 years ago
- Guides and examples to help achieve optimal performance on a NVIDIA Grace CPU☆15Updated last year
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆64Updated 5 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 7 months ago
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆33Updated 2 weeks ago
- ☆57Updated this week
- Globus Compute: High Performance Function Serving for Science☆158Updated last week
- Metastack: an enhanced and performance optimized version of Slurm☆54Updated 2 months ago
- The CUDA target for Numba☆191Updated this week
- Python bindings for UCX☆140Updated last week
- ☆51Updated 3 months ago
- A task benchmark☆43Updated last year