kasper0406 / stablehlo-coremlLinks
Convert StableHLO models into Apple Core ML format
☆19Updated last month
Alternatives and similar repositories for stablehlo-coreml
Users that are interested in stablehlo-coreml are comparing it to the libraries listed below
Sorting:
- Backward compatible ML compute opset inspired by HLO/MHLO☆541Updated this week
- ☆330Updated 2 weeks ago
- ☆52Updated last year
- Stores documents and resources used by the OpenXLA developer community☆129Updated last year
- jax-triton contains integrations between JAX and OpenAI Triton☆423Updated 3 weeks ago
- MLIR-based partitioning system☆135Updated this week
- High-Performance SGEMM on CUDA devices☆101Updated 8 months ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆307Updated this week
- ☆176Updated last year
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆536Updated 3 weeks ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆299Updated last week
- Fast low-bit matmul kernels in Triton☆371Updated last week
- An experimental CPU backend for Triton☆153Updated 3 months ago
- Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.☆362Updated this week
- extensible collectives library in triton☆87Updated 5 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆576Updated last month
- ☆237Updated last week
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆123Updated last week
- Ahead of Time (AOT) Triton Math Library☆76Updated last week
- TORCH_LOGS parser for PT2☆60Updated this week
- Fastest kernels written from scratch☆346Updated last week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆57Updated this week
- AMD RAD's experimental RMA library for Triton.☆74Updated this week
- Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!☆96Updated last week
- JAX-Toolbox☆337Updated last week
- seqax = sequence modeling + JAX☆167Updated 2 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆142Updated last year
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆45Updated last month
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆353Updated this week
- ☆187Updated 3 weeks ago