alexarmbr/matmul-playground

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alexarmbr/matmul-playground)

alexarmbr / matmul-playground

☆29

Alternatives and similar repositories for matmul-playground

Users that are interested in matmul-playground are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

philipfabianek / ptx-playground
View on GitHub
A simple environment for writing and experimenting with hand-written CUDA PTX kernels.
☆18Sep 11, 2025Updated 10 months ago
fishmingyu / qrv2-gpu-mode
View on GitHub
Batched square compact-Householder QR factorization.
☆14Jul 2, 2026Updated 3 weeks ago
alibaba / redfuser
View on GitHub
☆21Mar 17, 2026Updated 4 months ago
seb-v / fp32_sgemm_amd
View on GitHub
Super fast FP32 matrix multiplication on RDNA3
☆92Mar 30, 2025Updated last year
nqdtan / vck5000_vivado_ulp
View on GitHub
An alternative Vivado custom design example (to fully Vitis) for the User Logic Partition targeting VCK5000
☆14Jul 16, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Better-Call-Paul / blackwell_gemm
View on GitHub
☆19Apr 26, 2026Updated 3 months ago
MarioLulab / Needle
View on GitHub
A basic deep learning library, comparable to a very minimal version of PyTorch.
☆19Mar 1, 2023Updated 3 years ago
PSCLab-ASU / Systolic-CNN
View on GitHub
☆18Feb 13, 2021Updated 5 years ago
foundation-model-stack / vllm-triton-backend
View on GitHub
A Triton-only attention backend for vLLM
☆27Jul 14, 2026Updated 2 weeks ago
khaki3 / ptxas-wrapper
View on GitHub
A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code
☆16Mar 19, 2023Updated 3 years ago
horizon-research / imagen
View on GitHub
☆10Mar 8, 2025Updated last year
ColfaxResearch / cutlass-kernels
View on GitHub
☆270Jul 11, 2024Updated 2 years ago
HazyResearch / ThunderMittens
View on GitHub
☆19Aug 26, 2025Updated 11 months ago
AIS-SNU / GraNNDis_Artifact
View on GitHub
[PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…
☆10Aug 13, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
leimao / CUTLASS-Examples
View on GitHub
CUTLASS and CuTe Examples
☆137Nov 30, 2025Updated 7 months ago
RotemBenHur / SIMPLER-MAGIC
View on GitHub
SIMPLER MAGIC: Synthesis and In-memory MaPping of Logic Execution in a single Row for Memristor Aided loGIC
☆13Dec 5, 2019Updated 6 years ago
ChijinZ / PolyJuice-Fuzzer
View on GitHub
A DL compiler fuzzer
☆15Nov 1, 2024Updated last year
theunnecessarythings / llm-ptx
View on GitHub
GPT2 in handwritten PTX
☆15Jun 29, 2025Updated last year
lemire / StarSchemaBenchmark
View on GitHub
O'Neil et al.'s Star Schema Benchmark: curated code
☆21May 19, 2025Updated last year
githwxi / XATSHOME
View on GitHub
For hosting ATS3 and developing CodeDepot
☆18Jun 14, 2026Updated last month
kilianhae / FlashAttention.C
View on GitHub
Flash Attention in raw Cuda C beating PyTorch
☆39May 14, 2024Updated 2 years ago
Aleph-Alpha / Alpha-MoE
View on GitHub
☆74Dec 10, 2025Updated 7 months ago
k4m4 / hex-cli
View on GitHub
Hex encode & decode a string, right from your terminal.
☆10Jan 5, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
simveit / effective_transpose
View on GitHub
Effective transpose on Hopper GPU
☆29Sep 6, 2025Updated 10 months ago
uiuc-arc / llm-code-watermark
View on GitHub
LLM Program Watermarking
☆19Apr 19, 2024Updated 2 years ago
CSA-infra / RISCV-Scalable-Simulation-tutorial
View on GitHub
☆15Feb 2, 2026Updated 5 months ago
junyuan-chen / LCPsolve.jl
View on GitHub
A solver for linear complementarity problems
☆12Dec 16, 2021Updated 4 years ago
compiler-disagg / TrackFM
View on GitHub
A compiler to automatically transform applications into disaggregated memory apps.
☆17Nov 16, 2023Updated 2 years ago
jrevels / MixedModeBroadcastAD.jl
View on GitHub
☆12May 23, 2018Updated 8 years ago
ezyang / cute-interactive
View on GitHub
Interactive version of the CuTe layout paper
☆57Apr 14, 2026Updated 3 months ago
coderlemon17 / LemonScripts
View on GitHub
Here is the repo for public scripts.
☆12Jul 16, 2022Updated 4 years ago
ColfaxResearch / cfx-article-src
View on GitHub
☆193May 7, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
estwings57 / HMC-MAC
View on GitHub
Processing-in Memory Architecture for Multiply-Accumulate Operations with Hybrid Memory Cube
☆12Feb 13, 2017Updated 9 years ago
arjundevraj / stragglar
View on GitHub
☆15Oct 2, 2025Updated 9 months ago
facebookexperimental / triton
View on GitHub
Github mirror of trition-lang/triton repo.
☆181Updated this week
whongzhong / MMHalSnowball
View on GitHub
Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20…
☆18Aug 12, 2024Updated last year
GATECH-EIC / torchshiftadd
View on GitHub
An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.
☆14Feb 3, 2025Updated last year
amckay / OptStab
View on GitHub
Replication material for "Optimal Automatic Stabilizers"
☆11Aug 9, 2021Updated 4 years ago
hatsu3 / Sanger
View on GitHub
☆48Aug 23, 2021Updated 4 years ago