SJTU-HPC / hpcbenchmarksLinks
高性能计算系统性能评价工具集
☆20Updated last year
Alternatives and similar repositories for hpcbenchmarks
Users that are interested in hpcbenchmarks are comparing it to the libraries listed below
Sorting:
- An HPC and Cloud Computing Fused Job Scheduling System☆109Updated this week
- Intel AVX-512简介☆50Updated last year
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆54Updated 3 years ago
- Efficient Compute-Communication Overlap for Distributed LLM Inference☆22Updated 2 weeks ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆28Updated 3 months ago
- Super Computing On Web☆282Updated 2 weeks ago
- Metastack: an enhanced and performance optimized version of Slurm☆52Updated 2 weeks ago
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- Sky Computing: Accelerating Geo-distributed Computing in Federated Learning☆91Updated 2 years ago
- Automated machine learning as an AI-HPC benchmark☆66Updated 2 years ago
- Optimized primitives for collective multi-GPU communication☆23Updated last year
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆35Updated last year
- SJTU HPC 开源项目:Spackenv (Spack ENVironment) switch environments between sysadmin, users and developers.☆22Updated 3 years ago
- The driver for LMCache core to run in vLLM☆44Updated 5 months ago
- Magnum IO community repo☆95Updated 2 months ago
- ☆31Updated this week
- ☆58Updated 4 years ago
- Bitfusion with Kubernetes Integration Support☆50Updated last year
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆129Updated last week
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆27Updated last week
- NVIDIA GPUDirect Storage Driver☆260Updated 2 months ago
- Runtime Tracing Library for TensorFlow☆43Updated 6 years ago
- ☆44Updated 6 months ago
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆95Updated 2 months ago
- ☆43Updated last year
- A ServerSAN storage system designed for flash device☆113Updated 3 weeks ago
- A tool to detect infrastructure issues on cloud native AI systems☆42Updated last month
- An I/O benchmark for deep Learning applications☆88Updated 3 weeks ago
- A collection of reproducible inference engine benchmarks☆32Updated 2 months ago
- ☆37Updated 7 months ago