neurokernel / gpu-cluster-configLinks

How to Configure a GPU Cluster Running Ubuntu Linux

☆60

Alternatives and similar repositories for gpu-cluster-config

Users that are interested in gpu-cluster-config are comparing it to the libraries listed below

Sorting:

dholt / slurm-gpu
Scheduling GPU cluster workloads with Slurm
☆76Updated 7 years ago
mknoxnv / ubuntu-slurm
Steps to create a small slurm cluster with GPU enabled nodes
☆272Updated 2 years ago
nateGeorge / slurm_gpu_ubuntu
Instructions for setting up a SLURM cluster using Ubuntu 18.04.3 with GPUs.
☆153Updated last week
mlcommons / training_results_v0.5
This repository contains the results and code for the MLPerf™ Training v0.5 benchmark.
☆35Updated 6 months ago
HewlettPackard / dlcookbook-dlbs
Deep Learning Benchmarking Suite
☆130Updated 2 years ago
sylabs / examples
files and instructions for creating and using example containers from the sylabs.io blog
☆104Updated 2 years ago
hannes-brt / cudnn-python-wrappers
Python wrappers for the NVIDIA cuDNN libraries
☆142Updated 8 years ago
msalvaris / gpu_monitor
Monitor your GPUs whether they are on a single computer or in a cluster
☆162Updated 6 years ago
Microway / gpu-burn
Microway's improved version of GPU Burn
☆89Updated last year
PatWie / cluster-smi
nvidia-smi but for an entire GPU cluster
☆80Updated last year
jnbntz / gpu-edu-workshops
Code examples for CUDA and OpenACC
☆34Updated last year
SciDAS / slurm-in-docker
Slurm in Docker - Exploring Slurm using CentOS 7 based Docker images
☆129Updated 6 years ago
jonsafari / nvidia-ml-py
Bugfixing fork of Python bindings for the NVIDIA GPU Management Library
☆51Updated 8 years ago
fbcotter / py3nvml
Python 3 Bindings for NVML library. Get NVIDIA GPU status inside your program.
☆249Updated 3 years ago
mlcommons / training_results_v0.6
This repository contains the results and code for the MLPerf™ Training v0.6 benchmark.
☆42Updated 2 years ago
intel / ideep
Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.
☆171Updated this week
mlcommons / training_results_v0.7
This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.
☆57Updated 2 years ago
flame / fmm-gen
Generating Families of Practical Fast Matrix Multiplication Algorithms
☆12Updated 8 years ago
mcarilli / mixed_precision_references
Personal collection of references for high performance mixed precision training.
☆41Updated 6 years ago
OleHolmNielsen / Slurm_tools
My tools for the Slurm HPC workload manager
☆555Updated 2 months ago
matex-org / matex
Machine Learning Toolkit for Extreme Scale (MaTEx)
☆112Updated 7 years ago
NVIDIA / pyxis
Container plugin for Slurm Workload Manager
☆396Updated last week
adityaiitb / pyprof2
PyProf2: PyTorch Profiling tool
☆82Updated 5 years ago
NVIDIA / cnmem
A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory
☆299Updated 6 years ago
yaroslavvb / memory_util
TensorFlow util for building memory usage timeline from LOG_MEMORY messages
☆65Updated 7 years ago
NVIDIA / hpc-container-maker
HPC Container Maker
☆499Updated last month
tensorpack / benchmarks
Use TensorFlow efficiently
☆96Updated 4 years ago
loudinthecloud / dpwa
Distributed Learning by Pair-Wise Averaging
☆52Updated 8 years ago
hclhkbu / dlbench
Benchmarking State-of-the-Art Deep Learning Software Tools
☆171Updated 8 years ago
horovod / tutorials
Tutorials for Horovod
☆85Updated 4 years ago