MEHDI342 / CUDAMLinks
CUDA-to-Metal MPS translation (M1) ( CUDA TO MAC)
☆57Updated 5 months ago
Alternatives and similar repositories for CUDAM
Users that are interested in CUDAM are comparing it to the libraries listed below
Sorting:
- FlashAttention (Metal Port)☆572Updated last year
- C API for MLX☆158Updated this week
- Profile your CoreML models directly from Python 🐍☆29Updated 4 months ago
- MLX support for the Open Neural Network Exchange (ONNX)☆63Updated last year
- LLM training in simple, raw C/Metal Shading Language☆60Updated last year
- Study and Implementations of Numerical Algorithms on Apple M1 and A* Devices☆149Updated 3 years ago
- mlx image models for Apple Silicon machines☆90Updated last month
- Find out why your CoreML model isn't running on the Neural Engine!☆30Updated last year
- ☆58Updated 2 years ago
- Simple high-throughput inference library☆155Updated 7 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆23Updated last year
- A minimalistic C++ Jinja templating engine for LLM chat templates☆202Updated 3 months ago
- Python bindings for ggml☆146Updated last year
- Emulating double-precision arithmetic on Apple GPUs☆58Updated 2 years ago
- A collection of optimizers for MLX☆54Updated last month
- A simple, hackable text-to-speech system in PyTorch and MLX☆184Updated 5 months ago
- Renderer for molecular nanotechnology☆88Updated last week
- 1.58 Bit LLM on Apple Silicon using MLX☆237Updated last year
- Port of Meta's Encodec in C/C++☆227Updated last year
- Apple GPU microarchitecture☆569Updated last year
- ☆18Updated last year
- Artificial Life simulations☆57Updated 10 months ago
- Benchmark of Apple MLX operations on all Apple Silicon chips (GPU, CPU) + MPS and CUDA.☆209Updated last week
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆72Updated 5 months ago
- Performance of PyTorch on Apple Silicon☆50Updated 2 years ago
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- Run transformers (incl. LLMs) on the Apple Neural Engine.☆63Updated 2 years ago
- ☆24Updated 2 years ago
- ☆12Updated last year
- ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)☆231Updated last year