rai-project / rai
The RAI client allows one to interact with a cluster of machine to submit and evaluate code. RAI is a scalable job submission system designed for diverse workloads. RAI’s design addresses challenges of scalability, configurability, security, and cost in delivering a flexible programming environments.
☆36Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for rai
- A tool for examining GPU scheduling behavior.☆70Updated 3 months ago
- Reference workloads for modern deep learning methods.☆73Updated last year
- 2019 Fall ECE408 Project Resources + Requirements☆77Updated 3 years ago
- Applied Parallel Programming UIUC FA 2017☆29Updated 6 years ago
- ☆30Updated last year
- ☆17Updated 4 years ago
- ParaDnn: A systematic performance analysis methodology for deep learning.☆39Updated 4 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆128Updated 4 years ago
- HCC Sample Applications☆13Updated 7 years ago
- First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.☆45Updated 6 years ago
- HPC Challenge Benchmark☆48Updated last year
- Tests and benchmarks for cudnn (and in the future, other nvidia libraries)☆53Updated 4 years ago
- ☆22Updated 5 years ago
- a multi-node fabric-attached memory manager that provides simple abstractions for accessing and allocating NVM from fabric-attached memor…☆9Updated 5 months ago
- IMPACT GPU Algorithms Teaching Labs☆55Updated last year
- A simple tool to profile performance of multiple combinations of GEMM of cuBLAS☆24Updated 3 years ago
- Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…☆40Updated 6 years ago
- Magnum IO community repo☆79Updated 5 months ago
- ☆20Updated 2 years ago
- PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolution…☆18Updated last month
- CUPTI GPU Profiler☆37Updated 5 years ago
- ☆10Updated last year
- GVProf: A Value Profiler for GPU-based Clusters☆47Updated 7 months ago
- Some source code about matrix multiplication implementation on CUDA☆35Updated 6 years ago
- Experiments evaluating preemption on the NVIDIA Pascal architecture☆18Updated 8 years ago
- Infiniband verbs performance tests (fork of git://git.openfabrics.org/~grockah/perftest.git)☆16Updated 8 years ago
- GPUDirect Async support for IB Verbs☆90Updated 2 years ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆39Updated 2 years ago
- OpenSHMEM Reference Implementation over UCX for Specification 1.4 and up☆33Updated last year
- Automatic virtualization of (general) accelerators.☆40Updated last year