awslabs / libfabric-ci-scripts
A place for all the various scripts utilized in the libfabric ci project such as pipelines and packer files.
☆11Updated last year
Alternatives and similar repositories for libfabric-ci-scripts:
Users that are interested in libfabric-ci-scripts are comparing it to the libraries listed below
- AWS Libfabric☆38Updated last week
- Rapid HPC Orchestration in the Cloud☆28Updated last year
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆42Updated last year
- Intel HPC Containers using Singularity☆19Updated 2 years ago
- ☆37Updated 9 months ago
- Tools for MPI programmers☆14Updated 4 years ago
- HPCToolkit performance tools: essential third party libraries for hpctoolkit☆8Updated 5 years ago
- Template scripts to setup Docker Images compatible with running on MNP Batch☆14Updated 5 years ago
- The System Stacks for Linux* OS are a collection of production ready docker images for Deep Learning, Media and Storage optimized for 2nd…☆34Updated 2 years ago
- aws-parallelcluster-node is the python package installed on the Amazon EC2 instances launched as part of AWS ParallelCluster☆65Updated this week
- Cray Lustre is HPE's curated Lustre distro for Cray EX and other Cray ClusterStor clients☆16Updated this week
- ☆12Updated 7 months ago
- RDMA core userspace libraries and daemons☆13Updated 2 months ago
- CAST can enhance the system management of cluster-wide resources. It consists of the open source tools: cluster system management (CSM) a…☆27Updated 2 years ago
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆63Updated last year
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆167Updated this week
- Data Accelerator: Creates a burst buffer from generic hardware and integrates it with Slurm https://www.hpc.cam.ac.uk/research/data-acc h…☆18Updated last year
- ☆11Updated 3 months ago
- Singularity Image Format (SIF) reference implementation.☆18Updated 2 weeks ago
- A multi-platform experimentation framework written in python.☆48Updated this week
- IO-500☆37Updated 4 years ago
- Pytorch process group third-party plugin for UCC☆20Updated 11 months ago
- Deploying EFA in EKS utilizing GPUDirectRDMA where supported☆37Updated 5 months ago
- ☆16Updated last year
- HPCPerfStats (formerly TACC Stats) is an automated resource-usage monitoring and analysis package.☆46Updated 2 weeks ago
- A collection of unit test to RDMA providers using libibverbs☆30Updated last week
- ☆23Updated 3 years ago
- The open source version of the AWS ParallelCluster User Guide.☆25Updated last year
- XALT: System tracking of users codes on clusters☆43Updated last month
- Apollo: Online Machine Learning for Performance Portability☆22Updated 7 months ago