An MLIR-based compiler that takes GPU kernels and compiles them to real hardware instructions. Interactive web visualizer included.
☆128Mar 21, 2026Updated last month
Alternatives and similar repositories for tiny-gpu-compiler
Users that are interested in tiny-gpu-compiler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of cryptographic and zero-knowledge algorithms implemented from scratch in Rust☆23Nov 21, 2024Updated last year
- 晚上下班不刷手机,学点什么。系列一:CUDA 计算框架 CUFX (Cuda Framework eXtended)。☆16Dec 15, 2024Updated last year
- An interactive web-based tool for exploring intermediate representations of PyTorch and Triton models☆49Jan 23, 2026Updated 3 months ago
- RPC request router and proxy for Starknet, forked from Optimism proxyd.☆12Feb 26, 2024Updated 2 years ago
- QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.☆38Aug 29, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Triton Compiler related materials.☆44Mar 16, 2026Updated last month
- A demonstration of source code transformation to implement automatic differentiation, compatible with an operation overload style AD libr…☆14Jul 15, 2022Updated 3 years ago
- ☆18Jul 11, 2021Updated 4 years ago
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- An example application that uses SkiaSharp with Wpf☆15Apr 1, 2016Updated 10 years ago
- ☆11Sep 21, 2022Updated 3 years ago
- ☆14Jan 22, 2025Updated last year
- ☆14Jun 18, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆30Apr 21, 2026Updated last week
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 7 months ago
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 8 months ago
- A research-driven project focused on the Comparison of Multilinear Polynomial Commitment Schemes☆39Oct 17, 2025Updated 6 months ago
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- Sandgarden AI Software Factory☆116Apr 20, 2026Updated last week
- ☆123Sep 22, 2025Updated 7 months ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Nov 3, 2025Updated 5 months ago
- ☆13Jul 2, 2025Updated 9 months ago
- Fast, allocation-friendly .NET library to generate, parse, and manipulate ANSI/VT escape sequences (writer, markup, tokenizer, ANSI-aware…☆31Feb 14, 2026Updated 2 months ago
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zig☆14Apr 7, 2025Updated last year
- ☆17Apr 15, 2022Updated 4 years ago
- Tutorial Exercises and Code for GPU Communications Tutorial at HOT Interconnects 2025☆31Oct 22, 2025Updated 6 months ago
- The main codex repository☆26Feb 3, 2026Updated 2 months ago
- ☆15Mar 26, 2025Updated last year
- UK Mountain Weather App☆12Dec 29, 2014Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jul 23, 2024Updated last year
- This project aims to replicate mainstream open-source model architectures with limited computational resources, implementing mini models …☆171Apr 19, 2026Updated last week
- Cute layout visualization☆37Jan 18, 2026Updated 3 months ago
- An experiment to do acoustic beamforming and beamsteering with Arduino.☆29Feb 22, 2023Updated 3 years ago
- Performance comparison between a CollectionView and DrawnUI for .NET MAUI☆19Nov 2, 2024Updated last year
- ☆63Dec 6, 2024Updated last year
- DoubleAI’s hyperoptimised version of cuGraph☆51Mar 3, 2026Updated last month