kfish / micrograd-cpp-2023
A C++ port of karpathy/micrograd, a tiny scalar-valued autograd engine and a neural net library
☆13Updated last year
Alternatives and similar repositories for micrograd-cpp-2023:
Users that are interested in micrograd-cpp-2023 are comparing it to the libraries listed below
- Retargetable ML compilers for the twenty-first century!☆12Updated last week
- MLIR-based toolkit targeting intel heterogeneous hardware☆38Updated last month
- ☆23Updated 3 years ago
- AMD’s C++ library for accelerating tensor primitives☆39Updated this week
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last week
- A minimal (really) out-of-tree MLIR example☆43Updated 3 weeks ago
- SYCL Reference Manual☆27Updated 11 months ago
- Source code for 'Modern Parallel Programming with C++ and Assembly' by Dan Kusswurm☆63Updated 2 years ago
- Website for CS 265☆28Updated 3 months ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆45Updated 3 years ago
- ☆17Updated 10 months ago
- MLIR-based partitioning system☆76Updated this week
- GPU B-Tree with support for versioning (snapshots).☆47Updated 5 months ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆117Updated 2 months ago
- ☆56Updated last week
- A GLSL compiler targeting SPIR-V mlir☆19Updated 5 months ago
- ☆14Updated 11 months ago
- CuPBoP-AMD is a CUDA translator that translates CUDA programs at NVVM IR level to HIP-compatible IR that can run on AMD GPUs.☆36Updated last year
- Source for the OpenCilk runtime system, based on Cheetah.☆22Updated last week
- Efficient implementations of Merge Sort and Bitonic Sort algorithms using CUDA for GPU parallel processing, resulting in accelerated sort…☆13Updated last year
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Updated 3 years ago
- IREE C++ Template☆17Updated 8 months ago
- PTX-EMU is a simple emulator for CUDA program.☆30Updated last year
- MLIR metal dialect☆25Updated 6 months ago
- Header-only safetensors loader and saver in C++☆56Updated 3 weeks ago
- The Farm-SVE package provides a header that implements the ARM C language extensions (ACLE) for the ARM Scalable Vector Extension (SVE) i…☆14Updated last year
- Experiments and prototypes associated with IREE or MLIR☆50Updated 7 months ago
- Little OpenMP Library☆159Updated 2 years ago
- ☆44Updated this week