cake-lab / DELILinks
Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying and Improving Performance of Distributed Deep Learning with Cloud Storage, to be published in IC2E 2021
☆11Updated 4 years ago
Alternatives and similar repositories for DELI
Users that are interested in DELI are comparing it to the libraries listed below
Sorting:
- This is repository for a I/O benchmark which represents Scientific Deep Learning Workloads.☆23Updated 3 years ago
- MLPerf™ logging library☆38Updated last month
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆21Updated 3 years ago
- AMD HPC Research Fund Cloud☆17Updated 3 weeks ago
- Metastack: an enhanced and performance optimized version of Slurm☆52Updated this week
- Get started with your NVIDIA Arm HPC Developers Kit!☆33Updated 2 years ago
- ☆23Updated last week
- SParse AcceleRation on Tensor Architecture☆18Updated 10 months ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆18Updated 7 months ago
- A tracing infrastructure for heterogeneous computing applications.☆40Updated last week
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Updated 5 years ago
- A task benchmark☆44Updated last year
- ☆53Updated this week
- MAD (Model Automation and Dashboarding)☆31Updated this week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆204Updated this week
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Updated 2 years ago
- MANA for MPI☆48Updated 5 months ago
- VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.☆10Updated 3 years ago
- An I/O benchmark for deep Learning applications☆102Updated last month
- This repository contains the results and code for the MLPerf™ Training v2.1 benchmark.☆15Updated 2 years ago
- Python bindings for UCX☆139Updated 4 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 11 months ago
- Benchmarks to capture important workloads.☆32Updated 2 weeks ago
- ☆55Updated 2 months ago
- A library that translates Python and NumPy to optimized distributed systems code.☆131Updated 3 years ago
- Material for the SC22 Deep Learning at Scale Tutorial☆41Updated 2 years ago
- GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs☆16Updated 9 months ago
- Sparsity support for PyTorch☆38Updated 10 months ago
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…☆21Updated 2 months ago
- Material for the SC21 Deep Learning at Scale Tutorial☆27Updated 2 years ago