LaurentMazare / gemm-metalLinks
☆17Updated last year
Alternatives and similar repositories for gemm-metal
Users that are interested in gemm-metal are comparing it to the libraries listed below
Sorting:
- ☆19Updated last year
- ☆12Updated last year
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆124Updated 2 months ago
- JAX bindings for the flash-attention3 kernels☆16Updated last month
- Rust crate for some audio utilities☆25Updated 8 months ago
- ☆53Updated last year
- High-Performance SGEMM on CUDA devices☆110Updated 9 months ago
- Experimental GPU language with meta-programming☆24Updated last year
- C API for MLX☆150Updated last month
- Experimental compiler for deep learning models☆70Updated 2 months ago
- ☆21Updated 8 months ago
- Gpu benchmark☆72Updated 9 months ago
- Sample Python extension using Rust/PyO3/tch to interact with PyTorch☆38Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆61Updated last week
- Experimentation using the xla compiler from rust☆98Updated last year
- ☆26Updated 7 months ago
- implement llava using candle☆15Updated last year
- train with kittens!☆63Updated last year
- Simple high-throughput inference library☆149Updated 6 months ago
- python package of rocm-smi-lib☆24Updated 4 months ago
- Profile your CoreML models directly from Python 🐍☆29Updated 2 months ago
- Attention in SRAM on Tenstorrent Grayskull☆38Updated last year
- Experiment of using Tangent to autodiff triton☆79Updated last year
- A Learning Journey: Micrograd in Mojo 🔥☆63Updated last year
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆306Updated 2 weeks ago
- asynchronous/distributed speculative evaluation for llama3☆38Updated last year
- LLM training in simple, raw C/Metal Shading Language☆60Updated last year
- Fast and Furious AMD Kernels☆110Updated this week
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Updated 2 years ago
- Thin wrapper around GGML to make life easier☆40Updated 2 weeks ago