NVIDIA / nvaitc-toolkit
Open source code base to showcase interoperability of CUDA-X AI software stack in multi-GPU environments and thus provide researchers a reference framework to build new projects on.
☆12Updated last year
Related projects: ⓘ
- A conda-smithy repository for nvcc.☆12Updated last month
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆23Updated last month
- Environment modules for NGC containers☆28Updated 2 years ago
- Fabric Manager packaging for Debian☆13Updated 3 years ago
- Reference CUDA implementation of training a small Bayesian neural network (BNN) using MCMC☆15Updated 3 years ago
- OpenMP for Python in Numba☆66Updated this week
- Responses to 2021 RFI on Stewardship of Software for Scientific and High-Performance Computing☆16Updated 2 years ago
- Benchmarking OpenBLAS on the Apple M1☆17Updated 3 years ago
- Loop Nest - Linear algebra compiler and code generator.☆22Updated last year
- Worked example of the process from Python source to CUDA kernel execution with Numba☆36Updated last week
- This repository provides the Open-CE environment files and version definitions for each Open-CE release.☆91Updated last week
- The Exascale Computing Project Software Technologies Capability Assessment Report - Public Version☆19Updated 2 years ago
- Data Parallel Extension for NumPy☆97Updated this week
- ☆28Updated this week
- NVIDIA Performance Libraries: Sample code☆19Updated 2 months ago
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆58Updated last month
- MLPerf™ Mobile models☆24Updated last month
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆15Updated this week
- Python bindings for OpenSHMEM☆13Updated last month
- Performance engineering for the rest of us.☆29Updated last year
- CUDA Template Functions☆18Updated last month
- MGARD: MultiGrid Adaptive Reduction of Data☆37Updated 6 months ago
- SIMULATeQCD is a multi-GPU Lattice QCD framework that makes it easy for physicists to implement lattice QCD formulas while still providin…☆30Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆41Updated 3 weeks ago
- Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.☆15Updated 6 months ago
- An Aspiring Drop-In Replacement for Pandas at Scale☆73Updated 2 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆73Updated 3 months ago
- Machine Learning for HPC Workflows☆119Updated last month
- A multi-platform experimentation framework written in python.☆38Updated this week
- Apollo: Online Machine Learning for Performance Portability☆22Updated 3 weeks ago