NVIDIA / mcmc-bnn-example
Reference CUDA implementation of training a small Bayesian neural network (BNN) using MCMC
☆17Updated 3 years ago
Alternatives and similar repositories for mcmc-bnn-example:
Users that are interested in mcmc-bnn-example are comparing it to the libraries listed below
- The Foundation for All Legate Libraries☆202Updated last month
- NVIDIA Math Libraries for the Python Ecosystem☆223Updated last month
- NPBench - A Benchmarking Suite for High-Performance NumPy☆76Updated 2 months ago
- Fast and full-featured Matrix Market I/O library for C++, Python, and R☆77Updated 5 months ago
- Example python (numpy) -- CUDA installable package with a C-extension library☆142Updated 5 years ago
- Exploring using stdpar and Cython☆33Updated 4 years ago
- Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python☆457Updated last week
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 2 years ago
- Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels☆322Updated this week
- Data Parallel Extension for Numba☆79Updated 2 months ago
- An example combining scikit-build and pybind11☆117Updated this week
- OpenMP for Python in Numba☆90Updated 3 weeks ago
- Sparse matrix tools extending scipy.sparse, but with incompatible licenses☆163Updated 3 months ago
- Data Parallel Extension for NumPy☆101Updated this week
- CUDA kernel author's tools☆110Updated 2 years ago
- Extending JAX with custom C++ and CUDA code☆383Updated 5 months ago
- A nanobind example project☆97Updated this week
- A massively-parallel, block-sparse tensor framework written in C++☆267Updated last week
- N-Ways to GPU Programming Bootcamp☆64Updated 3 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆104Updated 2 weeks ago
- GPU Eigensolver for symmetric/hermitian matrices.☆63Updated 3 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆67Updated last year
- Analyze graph/hierarchical performance data using pandas dataframes☆109Updated 3 months ago
- DLA-Future☆69Updated this week
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆32Updated this week
- An Aspiring Drop-In Replacement for NumPy at Scale☆821Updated 3 weeks ago
- Template for GPU accelerated python libraries☆45Updated last year
- An Aspiring Drop-In Replacement for Pandas at Scale☆75Updated 3 years ago
- Python SYCL bindings and SYCL-based Python Array API library☆106Updated this week
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year