A framework that helps implementing swizzle GPU kernels
☆51Feb 29, 2020Updated 6 years ago
Alternatives and similar repositories for swizzle-inventor
Users that are interested in swizzle-inventor are comparing it to the libraries listed below
Sorting:
- A retargetable and extensible synthesis-based compiler for modern hardware architectures☆17Nov 20, 2025Updated 3 months ago
- A tool for checking tool output inspired by LLVM's FileCheck☆12Aug 29, 2025Updated 6 months ago
- Iodine: Verifying Constant-Time Execution of Hardware☆15Mar 29, 2021Updated 4 years ago
- A Coq framework to support structural design and proof of hardware cache-coherence protocols☆14May 7, 2022Updated 3 years ago
- bil verification tool☆12Jun 30, 2022Updated 3 years ago
- GPU model checker☆12Apr 17, 2019Updated 6 years ago
- Intrusive data structures in Rust☆11May 26, 2015Updated 10 years ago
- A compiler synthesizer for simple languages.☆15Dec 18, 2018Updated 7 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Apr 6, 2021Updated 4 years ago
- Utilities for paper writing.☆12Jan 11, 2026Updated last month
- You Only Search Once: On Lightweight Differentiable Architecture Search for Resource-Constrained Embedded Platforms☆12Apr 17, 2023Updated 2 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 2 years ago
- ☆16Mar 3, 2025Updated last year
- Reticle evaluation (PLDI 2021)☆12Apr 12, 2021Updated 4 years ago
- MAFIA: Multiple Application Framework for GPU architectures☆28Jan 21, 2022Updated 4 years ago
- Design space for LLVM/Clang work☆45Jun 14, 2012Updated 13 years ago
- Public Release of Stream-Dataflow☆14May 17, 2019Updated 6 years ago
- Public repository for the 2019 Parallel Functional Programming course at DIKU☆15Jan 13, 2020Updated 6 years ago
- CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark☆34Jun 24, 2025Updated 8 months ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 6 months ago
- 支持GPU全链路加速的全同态加密(FHE)框架☆20Apr 18, 2025Updated 10 months ago
- A basic Docker-based installation of TVM☆11Jun 23, 2022Updated 3 years ago
- SOP level volumetric path tracer.☆13Mar 25, 2020Updated 5 years ago
- Visualize TVM Relay program graph☆12Nov 19, 2019Updated 6 years ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆14Nov 23, 2024Updated last year
- Re-implementation of the TASO compiler using equality saturation☆138Jun 28, 2021Updated 4 years ago
- Flexible GPGPU instrumentation☆89Oct 10, 2019Updated 6 years ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆133Jul 6, 2023Updated 2 years ago
- ☆19Oct 14, 2018Updated 7 years ago
- ☆15Mar 6, 2021Updated 5 years ago
- Benchmark PyTorch Custom Operators☆14Jul 6, 2023Updated 2 years ago
- Custom extensions to the RISC-V isa simulator for the UCB-BAR ESP project☆17Nov 27, 2022Updated 3 years ago
- Numpy-like encrypted matrix arithmetic library based on OpenFHE☆30Updated this week
- Search-based compiler for high-performance DSP programming☆71Oct 29, 2024Updated last year
- SIMD recipes, for various platforms (collection of code snippets)☆49Jun 3, 2021Updated 4 years ago
- Reviving the old comp-arch.net wiki?☆18Jun 21, 2023Updated 2 years ago
- A C compiler with SSA-based backend optimzation☆15Mar 19, 2016Updated 9 years ago
- SafeInit protects software from uninitialized read vulnerabilities - code released for NDSS 2017☆26May 5, 2021Updated 4 years ago
- ☆16Apr 22, 2025Updated 10 months ago