SJTU-HPC / hpcbenchmarksLinks
高性能计算系统性能评价工具集
☆23Updated last year
Alternatives and similar repositories for hpcbenchmarks
Users that are interested in hpcbenchmarks are comparing it to the libraries listed below
Sorting:
- An HPC and Cloud Computing Fused Job Scheduling System☆118Updated last week
- Super Computing On Web☆303Updated this week
- Automated machine learning as an AI-HPC benchmark☆65Updated 3 years ago
- ☆277Updated 2 years ago
- ☆58Updated 5 years ago
- An efficient GPU resource sharing system with fine-grained control for Linux platforms.☆85Updated last year
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆132Updated 3 weeks ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆29Updated 6 months ago
- Sky Computing: Accelerating Geo-distributed Computing in Federated Learning☆91Updated 2 years ago
- Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions☆159Updated 6 years ago
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆127Updated 3 years ago
- Metastack: an enhanced and performance optimized version of Slurm☆53Updated 3 months ago
- OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development…☆98Updated 4 years ago
- NVIDIA NCCL Tests for Distributed Training☆112Updated 2 weeks ago
- Bitfusion with Kubernetes Integration Support☆50Updated last year
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆86Updated last year
- ☆15Updated 2 months ago
- ☆28Updated last year
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆55Updated 3 years ago
- SJTU HPC 开源项目:Spackenv (Spack ENVironment) switch environments between sysadmin, users and developers.☆22Updated 3 years ago
- Public repository for the BeeGFS Parallel File System☆165Updated 3 months ago
- qCUDA: GPGPU Virtualization at a New API Remoting Method with Para-virtualization☆129Updated 3 years ago
- A tool to detect infrastructure issues on cloud native AI systems☆47Updated 3 weeks ago
- The driver for LMCache core to run in vLLM☆52Updated 8 months ago
- 配合 HAI Platform 使用的集成化用户界面☆53Updated 2 years ago
- Device-plugin for volcano vgpu which support hard resource isolation☆107Updated 2 weeks ago
- Mirror of official Lustre development repository http://git.whamcloud.com/☆221Updated this week
- ☆31Updated 5 months ago
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆35Updated last year
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆32Updated last month