Michalos88 / Randomized_SVD_in_CUDALinks
FAST Randomized SVD on a GPU with CUDA 🏎️
☆12Updated 6 years ago
Alternatives and similar repositories for Randomized_SVD_in_CUDA
Users that are interested in Randomized_SVD_in_CUDA are comparing it to the libraries listed below
Sorting:
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆47Updated this week
- No-GIL Python environment featuring NVIDIA Deep Learning libraries.☆62Updated 3 months ago
- Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.☆16Updated last year
- ☆52Updated 11 months ago
- ☆13Updated 4 years ago
- ☆12Updated last week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆85Updated 2 months ago
- Benchmarking PyTorch 2.0 different models☆21Updated 2 years ago
- ☆20Updated last year
- The CUDA target for Numba☆149Updated last week
- Xtructure is datastructure for using in JAX☆10Updated last week
- Analyze graph/hierarchical performance data using pandas dataframes☆116Updated 5 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆26Updated this week
- Benchmarking OpenBLAS on the Apple M1☆18Updated 4 years ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆115Updated 3 weeks ago
- A parallel framework for training deep neural networks☆62Updated 4 months ago
- LLM training in simple, raw C/CUDA☆99Updated last year
- Repository of machine learning benchmarks☆39Updated last week
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆109Updated last year
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆27Updated last week
- Benchmarks to capture important workloads.☆31Updated 5 months ago
- Data Parallel Extension for Numba☆82Updated 7 months ago
- cuASR: CUDA Algebra for Semirings☆36Updated 2 years ago
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆31Updated 3 months ago
- FFTX Project☆25Updated this week
- A tracing JIT compiler for PyTorch☆13Updated 3 years ago
- ☆28Updated 6 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆280Updated this week
- High dimensional black-box optimizer using Latent Action Monte Carlo Tree Search algorithm☆28Updated 2 years ago
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago