Parsers for CUDA binary files
☆25Dec 29, 2023Updated 2 years ago
Alternatives and similar repositories for cudaparsers
Users that are interested in cudaparsers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- ☆26Feb 20, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Handwritten GEMM using Intel AMX (Advanced Matrix Extension)☆17Jan 11, 2025Updated last year
- SGLang Kernel Wheel Index☆23Updated this week
- A CUDA kernel for NHWC GroupNorm for PyTorch☆23Nov 15, 2024Updated last year
- A survey of manufacturer-provided DRAM operating parameters and timings as specified by DRAM chip datasheets from between 1970 and 2021. …☆11May 4, 2022Updated 4 years ago
- ☆11Apr 3, 2023Updated 3 years ago
- Dynamic suballocators for external memory (e.g., Vulkan device memory). Umaintained - consider migrating to https://crates.io/crates/offs…☆15Jul 22, 2022Updated 3 years ago
- ☆20Sep 28, 2024Updated last year
- ☆26Feb 17, 2025Updated last year
- Debug print operator for cudagraph debugging☆15Aug 2, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Benchmark tests supporting the TiledCUDA library.☆19Nov 19, 2024Updated last year
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- Parse objdump files using tree-sitter☆13Nov 22, 2023Updated 2 years ago
- Native Rust implementation of Kubernetes api☆32Mar 10, 2026Updated 3 months ago
- A tiny FP8 multiplication unit written in Verilog. TinyTapeout 2 submission.☆14Nov 23, 2022Updated 3 years ago
- Open Source SSD Controller. NVMe and Lightstor variants☆17May 21, 2014Updated 12 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆32Dec 21, 2024Updated last year
- ☆11Jun 9, 2023Updated 3 years ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆29Mar 22, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Mar 24, 2025Updated last year
- A practical way of learning Swizzle☆41Feb 3, 2025Updated last year
- ☆32Jun 6, 2024Updated 2 years ago
- Pure Rust implementation of the meshoptimizer library☆28Dec 18, 2025Updated 6 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆27Updated this week
- FBX DOM library for Rust. // See https://github.com/lo48576/fbx-viewer for working example application // NO PLAN TO UPDATE in the forese…☆28Mar 20, 2023Updated 3 years ago
- Pseudo-LRU implementation using 1-bit per entry and achieving Full-LRU performance.☆23Dec 17, 2022Updated 3 years ago
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆95Feb 23, 2023Updated 3 years ago
- 🧮 Polynomial Calculator☆12Jan 3, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- General Purpose Graphics Processing Unit (GPGPU) IP Core☆11Jul 4, 2014Updated 11 years ago
- ☆15Dec 16, 2021Updated 4 years ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆37May 30, 2026Updated 2 weeks ago
- Energy Consumption-Aware Tabular Benchmark For Neural Architecture Search☆11Aug 18, 2025Updated 10 months ago
- corundum work on vu13p☆23Nov 10, 2023Updated 2 years ago
- LFSC Proof Checker☆11Sep 14, 2023Updated 2 years ago
- Wait free synchronization primitives☆23May 19, 2026Updated 3 weeks ago