nv-legate / legate.sparse
☆13Updated last month
Related projects: ⓘ
- NPBench - A Benchmarking Suite for High-Performance NumPy☆73Updated 3 months ago
- The Foundation for All Legate Libraries☆186Updated last week
- Material for the SC22 Deep Learning at Scale Tutorial☆39Updated last year
- A Data-Centric Compiler for Machine Learning☆81Updated 8 months ago
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆25Updated last week
- NVIDIA curated collection of educational resources related to general purpose GPU programming.☆58Updated last month
- Analyze graph/hierarchical performance data using pandas dataframes☆105Updated last month
- SParse AcceleRation on Tensor Architecture☆17Updated 2 weeks ago
- An Aspiring Drop-In Replacement for Pandas at Scale☆73Updated 2 years ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆15Updated this week
- Cosmic Tagging Network for Neutrino Physics☆12Updated 2 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆39Updated 3 months ago
- Data Parallel Extension for Numba☆75Updated this week
- ☆64Updated 2 weeks ago
- Worked example of the process from Python source to CUDA kernel execution with Numba☆36Updated last week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆41Updated 3 weeks ago
- An HPL-AI implementation for Fugaku☆19Updated 3 years ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆20Updated 7 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆21Updated last week
- ☆28Updated this week
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆27Updated 3 weeks ago
- MagmaDNN: a simple deep learning framework in c++☆45Updated 4 years ago
- A library that translates Python and NumPy to optimized distributed systems code.☆131Updated 2 years ago
- Data Parallel Extension for NumPy☆97Updated this week
- Intermediate MPI lesson☆25Updated last year
- Material for the SC21 Deep Learning at Scale Tutorial☆25Updated last year
- Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.☆15Updated 6 months ago
- NVIDIA Math Libraries for the Python Ecosystem☆194Updated 2 months ago
- POC work on MLIR backend☆46Updated 3 weeks ago
- Round matrix elements to lower precision in MATLAB☆35Updated 2 years ago