mangpo / swizzle-inventorView external linksLinks
A framework that helps implementing swizzle GPU kernels
☆51Feb 29, 2020Updated 5 years ago
Alternatives and similar repositories for swizzle-inventor
Users that are interested in swizzle-inventor are comparing it to the libraries listed below
Sorting:
- A retargetable and extensible synthesis-based compiler for modern hardware architectures☆17Nov 20, 2025Updated 2 months ago
- A tool for checking tool output inspired by LLVM's FileCheck☆12Aug 29, 2025Updated 5 months ago
- Iodine: Verifying Constant-Time Execution of Hardware☆15Mar 29, 2021Updated 4 years ago
- GPU model checker☆11Apr 17, 2019Updated 6 years ago
- bil verification tool☆12Jun 30, 2022Updated 3 years ago
- ☆16Mar 3, 2025Updated 11 months ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 2 years ago
- Intrusive data structures in Rust☆11May 26, 2015Updated 10 years ago
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆12Apr 17, 2023Updated 2 years ago
- Reticle evaluation (PLDI 2021)☆12Apr 12, 2021Updated 4 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Apr 6, 2021Updated 4 years ago
- A compiler synthesizer for simple languages.☆15Dec 18, 2018Updated 7 years ago
- MAFIA: Multiple Application Framework for GPU architectures☆28Jan 21, 2022Updated 4 years ago
- Public Release of Stream-Dataflow☆14May 17, 2019Updated 6 years ago
- Design space for LLVM/Clang work☆45Jun 14, 2012Updated 13 years ago
- CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark☆34Jun 24, 2025Updated 7 months ago
- ☆38Jul 19, 2025Updated 6 months ago
- A basic Docker-based installation of TVM☆11Jun 23, 2022Updated 3 years ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆14Nov 23, 2024Updated last year
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 6 months ago
- Numpy-like encrypted matrix arithmetic library based on OpenFHE☆28Feb 5, 2026Updated last week
- SOP level volumetric path tracer.☆13Mar 25, 2020Updated 5 years ago
- Re-implementation of the TASO compiler using equality saturation☆138Jun 28, 2021Updated 4 years ago
- Flexible GPGPU instrumentation☆89Oct 10, 2019Updated 6 years ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆133Jul 6, 2023Updated 2 years ago
- ☆15Mar 6, 2021Updated 4 years ago
- Benchmark PyTorch Custom Operators☆14Jul 6, 2023Updated 2 years ago
- ☆19Oct 14, 2018Updated 7 years ago
- Custom extensions to the RISC-V isa simulator for the UCB-BAR ESP project☆17Nov 27, 2022Updated 3 years ago
- Search-based compiler for high-performance DSP programming☆71Oct 29, 2024Updated last year
- SIMD recipes, for various platforms (collection of code snippets)☆49Jun 3, 2021Updated 4 years ago
- ☆16Apr 22, 2025Updated 9 months ago
- RDX implementation in Go☆37Oct 2, 2025Updated 4 months ago
- Reviving the old comp-arch.net wiki?☆18Jun 21, 2023Updated 2 years ago
- SafeInit protects software from uninitialized read vulnerabilities - code released for NDSS 2017☆26May 5, 2021Updated 4 years ago
- GPTQ inference TVM kernel☆40Apr 25, 2024Updated last year
- Liveness-driven random C code generator☆42Jul 30, 2025Updated 6 months ago
- Implementing SPMD control flow in LLVM using reconverging CFGs - Vectorizing Divergent Control-Flow for SIMD Applications☆18Apr 11, 2019Updated 6 years ago
- The Parrot stable and deterministic multi-threading system.☆25Nov 9, 2013Updated 12 years ago