Parsers for CUDA binary files
☆24Dec 29, 2023Updated 2 years ago
Alternatives and similar repositories for cudaparsers
Users that are interested in cudaparsers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- ☆26Feb 20, 2024Updated 2 years ago
- Handwritten GEMM using Intel AMX (Advanced Matrix Extension)☆17Jan 11, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- SGLang Kernel Wheel Index☆18Updated this week
- [WIP] Better (FP8) attention for Hopper☆32Feb 24, 2025Updated last year
- A CUDA kernel for NHWC GroupNorm for PyTorch☆23Nov 15, 2024Updated last year
- A survey of manufacturer-provided DRAM operating parameters and timings as specified by DRAM chip datasheets from between 1970 and 2021. …☆11May 4, 2022Updated 3 years ago
- (WIP) Level up your shader game with the GPU + Rust advantage!☆14Mar 4, 2024Updated 2 years ago
- Dynamic suballocators for external memory (e.g., Vulkan device memory). Umaintained - consider migrating to https://crates.io/crates/offs…☆15Jul 22, 2022Updated 3 years ago
- ☆26Feb 17, 2025Updated last year
- Quite OK image compression Verilog implementation☆23Nov 27, 2024Updated last year
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- Parse objdump files using tree-sitter☆13Nov 22, 2023Updated 2 years ago
- Native Rust implementation of Kubernetes api☆33Mar 10, 2026Updated 2 weeks ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆106Jun 28, 2025Updated 9 months ago
- A tiny FP8 multiplication unit written in Verilog. TinyTapeout 2 submission.☆14Nov 23, 2022Updated 3 years ago
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year
- ☆126Jan 22, 2026Updated 2 months ago
- ☆15Jan 8, 2024Updated 2 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆30Dec 21, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆11Jun 9, 2023Updated 2 years ago
- Framework to reduce autotune overhead to zero for well known deployments.☆98Sep 19, 2025Updated 6 months ago
- XML representation of the x86 instruction set☆29Feb 15, 2026Updated last month
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆30Mar 22, 2026Updated last week
- ☆34Feb 3, 2025Updated last year
- A Rust version of picosvg.☆14Oct 7, 2022Updated 3 years ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Mar 24, 2025Updated last year
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year
- Take a QEMU binary, copy the dependencies into a chroot☆11Oct 5, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆32Jun 6, 2024Updated last year
- Pure Rust implementation of the meshoptimizer library☆29Dec 18, 2025Updated 3 months ago
- FBX DOM library for Rust. // See https://github.com/lo48576/fbx-viewer for working example application // NO PLAN TO UPDATE in the forese…☆28Mar 20, 2023Updated 3 years ago
- 🎉My Collections of CUDA Kernels~☆11Jun 25, 2024Updated last year
- Pseudo-LRU implementation using 1-bit per entry and achieving Full-LRU performance.☆22Dec 17, 2022Updated 3 years ago
- 🧮 Polynomial Calculator☆12Jan 3, 2023Updated 3 years ago
- A library for reading MPQ archives.☆17Aug 14, 2024Updated last year