MEHDI342 / CUDAMLinks
CUDA-to-Metal MPS translation (M1) ( CUDA TO MAC)
☆48Updated 2 months ago
Alternatives and similar repositories for CUDAM
Users that are interested in CUDAM are comparing it to the libraries listed below
Sorting:
- A python library to run metal compute kernels on macOS☆84Updated 8 months ago
- FlashAttention (Metal Port)☆538Updated last year
- Profile your CoreML models directly from Python 🐍☆28Updated last month
- C API for MLX☆137Updated 2 weeks ago
- MLX support for the Open Neural Network Exchange (ONNX)☆59Updated last year
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆144Updated 2 years ago
- ☆54Updated 2 years ago
- LLM training in simple, raw C/Metal Shading Language☆58Updated last year
- Inference of Mamba models in pure C☆191Updated last year
- Python bindings for ggml☆146Updated last year
- Run models on native runtimes☆16Updated last week
- Find out why your CoreML model isn't running on the Neural Engine!☆26Updated last year
- GitHub Action to install CUDA☆191Updated last week
- Port of Meta's Encodec in C/C++☆222Updated 10 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆22Updated 11 months ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆122Updated 3 weeks ago
- Simple high-throughput inference library☆142Updated 5 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆189Updated 3 weeks ago
- example of using CoreML from c++☆23Updated 2 years ago
- LibTorch builds for the M1 Macs☆54Updated 2 months ago
- Emulating double-precision arithmetic on Apple GPUs☆55Updated 2 years ago
- 1.58 Bit LLM on Apple Silicon using MLX☆224Updated last year
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆68Updated 2 months ago
- A simple, hackable text-to-speech system in PyTorch and MLX☆175Updated 2 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆84Updated last year
- LLM training in simple, raw C/CUDA☆105Updated last year
- ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)☆223Updated last year
- Apple GPU microarchitecture☆553Updated last year
- mlx image models for Apple Silicon machines☆85Updated 6 months ago
- Use safetensors with ONNX 🤗☆69Updated 2 weeks ago