An MLIR-based compiler that takes GPU kernels and compiles them to real hardware instructions. Interactive web visualizer included.
☆127Mar 21, 2026Updated 2 weeks ago
Alternatives and similar repositories for tiny-gpu-compiler
Users that are interested in tiny-gpu-compiler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A cutting-edge zkWASM implementation leveraging Nova-NIVC-based folding techniques.☆41Oct 28, 2025Updated 5 months ago
- A series of high-performance GEMM (General Matrix Multiply) implementations Iteratively optimised for H100 GPUs in Pure CUDA.☆76Feb 18, 2026Updated last month
- QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.☆38Aug 29, 2025Updated 7 months ago
- Triton Compiler related materials.☆42Mar 16, 2026Updated 3 weeks ago
- CS341 for Spring 2024☆11Jul 15, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- teaching software 2.0 to programmers of software 1.0☆63Updated this week
- Test and benchmark your Rust library on mobile devices with ease.☆13Jul 17, 2023Updated 2 years ago
- ☆15Jan 16, 2024Updated 2 years ago
- A Zig implementation of Poseidon2 hash function.☆17Nov 11, 2025Updated 4 months ago
- Schnorr Signature algorithm usiing BLS12-381 Curve☆13Jan 10, 2024Updated 2 years ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- A demonstration of source code transformation to implement automatic differentiation, compatible with an operation overload style AD libr…☆14Jul 15, 2022Updated 3 years ago
- Build A Simple Web App With Sveltekit and Appwrite☆11Apr 3, 2023Updated 3 years ago
- .Net wrapper for the excellent libarchive project☆16Mar 24, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Starknet Unity SDK lets game developers to integrate Starknet blockchain functionality into their Unity projects with ease.☆13Jun 20, 2025Updated 9 months ago
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆25Aug 27, 2025Updated 7 months ago
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- Official website for the TRON (Token Reduced Object Notation) format☆38Nov 29, 2025Updated 4 months ago
- Simple sync/async event dispatcher for Rust☆17Dec 20, 2023Updated 2 years ago
- Bringing divine order to remote task execution.☆30Nov 25, 2024Updated last year
- ☆11Sep 21, 2022Updated 3 years ago
- ☆14Jan 22, 2025Updated last year
- Accelerated Zero-knowledge Virtual Machine by Non-uniform Prover Based on GKR Protocol☆140Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Tensor library & inference framework for machine learning☆118Oct 3, 2025Updated 6 months ago
- Hand-Rolled GPU communications library☆89Nov 25, 2025Updated 4 months ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Apr 3, 2026Updated last week
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 6 months ago
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 7 months ago
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- A research-driven project focused on the Comparison of Multilinear Polynomial Commitment Schemes☆38Oct 17, 2025Updated 5 months ago
- ☆121Sep 22, 2025Updated 6 months ago
- This project aims to replicate mainstream open-source model architectures with limited computational resources, implementing mini models …☆153Feb 10, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- ☆14Nov 3, 2025Updated 5 months ago
- ☆13Jul 2, 2025Updated 9 months ago
- applications of https://github.com/PrefectHQ/marvin☆13Jan 15, 2024Updated 2 years ago
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zig☆14Apr 7, 2025Updated last year
- Region-level profiling for CUDA kernels with trace, NVBit, CUPTI, and an interactive Explorer.☆103Mar 27, 2026Updated last week
- ☆17Apr 15, 2022Updated 3 years ago