thomaschlt / CUDAppleLinks

Exploration work on executing CUDA kernels on Apple Silicon (Metal-compatible code).

☆34

Alternatives and similar repositories for CUDApple

Users that are interested in CUDApple are comparing it to the libraries listed below

Sorting:

Oxen-AI / GRPO-With-Cargo-Feedback
This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
☆97Updated 4 months ago
nreHieW / r-nn
Tensor library with autograd using only Rust's standard library
☆68Updated last year
ljt019 / transformers
Transformers provides a simple, intuitive interface for Rust developers who want to work with Large Language Models locally, powered by t…
☆16Updated this week
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆81Updated 2 months ago
ml-explore / mlx-c
C API for MLX
☆117Updated 2 months ago
ml-explore / mlx-onnx
MLX support for the Open Neural Network Exchange (ONNX)
☆53Updated last year
google / minja
A minimalistic C++ Jinja templating engine for LLM chat templates
☆157Updated last month
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 4 months ago
kyutai-labs / moshi-swift
☆104Updated 2 weeks ago
stockeh / mlx-optimizers
A collection of optimizers for MLX
☆36Updated last month
facebookresearch / fastgen
Simple high-throughput inference library
☆118Updated last month
kanpuriyanawab / picograd
Rust Implementation of micrograd
☆52Updated last year
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆64Updated 8 months ago
huggingface / kernel-builder
👷 Build compute kernels
☆74Updated this week
FL33TW00D / deCoreML
Find out why your CoreML model isn't running on the Neural Engine!
☆25Updated last year
exo-explore / mlx-bitnet
1.58 Bit LLM on Apple Silicon using MLX
☆214Updated last year
PrimeIntellect-ai / pi-quant
SIMD quantization kernels
☆73Updated this week
Goekdeniz-Guelmez / mlx-lm-lora
Train Large Language Models on MLX.
☆126Updated this week
kyutai-labs / kaudio
Rust crate for some audio utilities
☆26Updated 4 months ago
regrettable-username / llm.metal
LLM training in simple, raw C/Metal Shading Language
☆56Updated last year
LucasSte / MLX-vs-Pytorch
Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs
☆87Updated 11 months ago
Vaibhavs10 / fast-llm.rs
☆137Updated last year
snowclipsed / moondream-zig
moondream in zig.
☆73Updated last month
EndlessReform / fish-speech.rs
A Fish Speech implementation in Rust, with Candle.rs
☆92Updated last month
RichardAragon / QGLS
☆32Updated 4 months ago
EricLBuehler / zig_ml
Tensor library for Zig
☆11Updated 7 months ago
atoma-network / atoma-infer
Fast serverless LLM inference, in Rust.
☆87Updated 4 months ago
kelechi-c / mini_DiT
minimal diffusion transformer in pytorch.
☆16Updated 9 months ago
PrimeIntellect-ai / pccl
PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP
☆96Updated last month
Apsu / flue
Fast, Lightweight, Unified Engine for Text2Image Diffusion Models
☆20Updated 2 months ago