cake-lab / DELILinks
Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying and Improving Performance of Distributed Deep Learning with Cloud Storage, to be published in IC2E 2021
☆11Updated 3 years ago
Alternatives and similar repositories for DELI
Users that are interested in DELI are comparing it to the libraries listed below
Sorting:
- AMD HPC Research Fund Cloud☆17Updated last month
- This is repository for a I/O benchmark which represents Scientific Deep Learning Workloads.☆23Updated 2 years ago
- Get started with your NVIDIA Arm HPC Developers Kit!☆33Updated 2 years ago
- MLPerf™ logging library☆37Updated last month
- ☆49Updated last week
- A tracing infrastructure for heterogeneous computing applications.☆38Updated 2 weeks ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 3 years ago
- COCCL: Compression and precision co-aware collective communication library☆28Updated 8 months ago
- A task benchmark☆45Updated last year
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 9 months ago
- MANA for MPI☆45Updated 2 months ago
- ☆24Updated last month
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆18Updated 4 months ago
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Updated 4 years ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆34Updated last week
- An I/O benchmark for deep Learning applications☆94Updated 3 weeks ago
- MAD (Model Automation and Dashboarding)☆30Updated last week
- ☆22Updated last month
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆63Updated last month
- benchmarking some transformer deployments☆26Updated this week
- Python bindings for OpenSHMEM☆25Updated last month
- Python bindings for UCX☆140Updated 2 months ago
- Build tools for Open-CE☆13Updated 2 weeks ago
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆198Updated last week
- VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.☆10Updated 3 years ago
- A benchmark suite for measuring HDF5 performance.☆43Updated 3 months ago
- GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs☆16Updated 7 months ago
- Metastack: an enhanced and performance optimized version of Slurm☆53Updated this week
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆15Updated 11 months ago
- Tools to run and parse MKL verbose mode☆18Updated 3 years ago