kfish / micrograd-cpp-2023
A C++ port of karpathy/micrograd, a tiny scalar-valued autograd engine and a neural net library
☆14Updated last year
Alternatives and similar repositories for micrograd-cpp-2023:
Users that are interested in micrograd-cpp-2023 are comparing it to the libraries listed below
- SYCL Reference Manual☆27Updated last year
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last month
- ☆23Updated 3 years ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆121Updated 3 months ago
- A minimal (really) out-of-tree MLIR example☆44Updated 2 weeks ago
- Tenstorrent MLIR compiler☆120Updated this week
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- Source code for 'Modern Parallel Programming with C++ and Assembly' by Dan Kusswurm☆63Updated 3 years ago
- MLIR-based toolkit targeting intel heterogeneous hardware☆40Updated 2 months ago
- NVIDIA tools guide☆129Updated 3 months ago
- ☆18Updated 2 weeks ago
- AMD’s C++ library for accelerating tensor primitives☆39Updated this week
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆18Updated last week
- CuPBoP-AMD is a CUDA translator that translates CUDA programs at NVVM IR level to HIP-compatible IR that can run on AMD GPUs.☆36Updated last year
- Teaching Vectorization and SIMD using Intel Intrinsics in a Computer Organization and Architecture class☆14Updated 2 months ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆38Updated 3 years ago
- SYCL Benchmark Suite☆64Updated 2 months ago
- Examples from Programming in Parallel with CUDA☆137Updated 2 years ago
- ☆24Updated this week
- The Farm-SVE package provides a header that implements the ARM C language extensions (ACLE) for the ARM Scalable Vector Extension (SVE) i…☆14Updated last year
- oneAPI Data Parallel C++ (DPC++) language reference☆26Updated 2 years ago
- TPP experimentation on MLIR for linear algebra☆127Updated this week
- Header-only safetensors loader and saver in C++☆56Updated last week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆86Updated this week
- ROCm Systems Profiler☆17Updated this week
- LLM training in simple, raw C/CUDA☆92Updated 11 months ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- ☆14Updated last year
- materials available to the public☆25Updated 5 months ago
- ☆56Updated last month