thomaschlt / CUDAppleLinks
Exploration work on executing CUDA kernels on Apple Silicon (Metal-compatible code).
☆34Updated last month
Alternatives and similar repositories for CUDApple
Users that are interested in CUDApple are comparing it to the libraries listed below
Sorting:
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆97Updated 4 months ago
- Tensor library with autograd using only Rust's standard library☆68Updated last year
- Transformers provides a simple, intuitive interface for Rust developers who want to work with Large Language Models locally, powered by t…☆16Updated this week
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆81Updated 2 months ago
- C API for MLX☆117Updated 2 months ago
- MLX support for the Open Neural Network Exchange (ONNX)☆53Updated last year
- A minimalistic C++ Jinja templating engine for LLM chat templates☆157Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 4 months ago
- ☆104Updated 2 weeks ago
- A collection of optimizers for MLX☆36Updated last month
- Simple high-throughput inference library☆118Updated last month
- Rust Implementation of micrograd☆52Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 8 months ago
- 👷 Build compute kernels☆74Updated this week
- Find out why your CoreML model isn't running on the Neural Engine!☆25Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆214Updated last year
- SIMD quantization kernels☆73Updated this week
- Train Large Language Models on MLX.☆126Updated this week
- Rust crate for some audio utilities☆26Updated 4 months ago
- LLM training in simple, raw C/Metal Shading Language☆56Updated last year
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆87Updated 11 months ago
- ☆137Updated last year
- moondream in zig.☆73Updated last month
- A Fish Speech implementation in Rust, with Candle.rs☆92Updated last month
- ☆32Updated 4 months ago
- Tensor library for Zig☆11Updated 7 months ago
- Fast serverless LLM inference, in Rust.☆87Updated 4 months ago
- minimal diffusion transformer in pytorch.☆16Updated 9 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆96Updated last month
- Fast, Lightweight, Unified Engine for Text2Image Diffusion Models☆20Updated 2 months ago