MEHDI342 / CUDAMLinks
CUDA-to-Metal MPS translation (M1) ( CUDA TO MAC)
☆52Updated 4 months ago
Alternatives and similar repositories for CUDAM
Users that are interested in CUDAM are comparing it to the libraries listed below
Sorting:
- FlashAttention (Metal Port)☆567Updated last year
- Profile your CoreML models directly from Python 🐍☆29Updated 3 months ago
- ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)☆229Updated last year
- C API for MLX☆155Updated this week
- LLM training in simple, raw C/Metal Shading Language☆60Updated last year
- Find out why your CoreML model isn't running on the Neural Engine!☆28Updated last year
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆149Updated 3 years ago
- MLX support for the Open Neural Network Exchange (ONNX)☆63Updated last year
- A simple, hackable text-to-speech system in PyTorch and MLX☆184Updated 4 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆227Updated last year
- A minimalistic C++ Jinja templating engine for LLM chat templates☆200Updated 2 months ago
- Apple GPU microarchitecture☆566Updated last year
- mlx image models for Apple Silicon machines☆88Updated 3 weeks ago
- Gpu benchmark☆73Updated 10 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆22Updated last year
- ☆57Updated 2 years ago
- ☆24Updated 2 years ago
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆206Updated 6 months ago
- Python bindings for ggml☆146Updated last year
- Simple high-throughput inference library☆152Updated 7 months ago
- 🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime☆95Updated 3 weeks ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆71Updated 4 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆213Updated last year
- Thin wrapper around GGML to make life easier☆40Updated last month
- LibTorch builds for the M1 Macs☆54Updated 4 months ago
- Emulating double-precision arithmetic on Apple GPUs☆56Updated 2 years ago
- TTS support with GGML☆201Updated 2 months ago
- GitHub Action to install CUDA☆196Updated last month
- Port of Meta's Encodec in C/C++☆227Updated last year
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago