MEHDI342 / CUDAMLinks
CUDA-to-Metal MPS translation (M1) ( CUDA TO MAC)
☆51Updated 3 months ago
Alternatives and similar repositories for CUDAM
Users that are interested in CUDAM are comparing it to the libraries listed below
Sorting:
- A python library to run metal compute kernels on macOS☆85Updated 10 months ago
- Profile your CoreML models directly from Python 🐍☆29Updated 2 months ago
- FlashAttention (Metal Port)☆557Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆22Updated last year
- C API for MLX☆153Updated last week
- MLX support for the Open Neural Network Exchange (ONNX)☆62Updated last year
- A minimalistic C++ Jinja templating engine for LLM chat templates☆198Updated 2 months ago
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆149Updated 2 years ago
- Find out why your CoreML model isn't running on the Neural Engine!☆27Updated last year
- LLM training in simple, raw C/Metal Shading Language☆60Updated last year
- example of using CoreML from c++☆24Updated 2 years ago
- mlx image models for Apple Silicon machines☆87Updated last month
- ☆12Updated last year
- Port of Meta's Encodec in C/C++☆226Updated 11 months ago
- ☆56Updated 2 years ago
- Apple GPU microarchitecture☆562Updated last year
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆24Updated last month
- ☆23Updated 2 years ago
- Thin wrapper around GGML to make life easier☆40Updated 3 weeks ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆70Updated 4 months ago
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆201Updated 5 months ago
- Emulating double-precision arithmetic on Apple GPUs☆55Updated 2 years ago
- LibTorch builds for the M1 Macs☆54Updated 4 months ago
- A Python interface for the Dawn WebGPU engine☆14Updated last month
- A simple, hackable text-to-speech system in PyTorch and MLX☆183Updated 3 months ago
- Port of Suno AI's Bark in C/C++ for fast inference☆53Updated last year
- ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)☆228Updated last year
- Python bindings for ggml☆146Updated last year
- Simple high-throughput inference library☆150Updated 6 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆211Updated last year