VivekPanyam / cudaparsersLinks
Parsers for CUDA binary files
☆25Updated last year
Alternatives and similar repositories for cudaparsers
Users that are interested in cudaparsers are comparing it to the libraries listed below
Sorting:
- A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.☆20Updated 2 years ago
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) APIs☆121Updated 3 weeks ago
- VectorVisor is a vectorizing binary translator for GPUs, designed to make it easy to run many copies of a single-threaded WebAssembly pro…☆155Updated last year
- Rust bindings to the MLIR C API.☆68Updated last month
- ☆85Updated this week
- Virtual machine for executing CUDA PTX without a GPU☆39Updated last year
- Tenstorrent system interface library☆32Updated last week
- Tools and experiments for 0sim. Simulate system software behavior on machines with terabytes of main memory from your desktop.☆21Updated 5 years ago
- Re-implementation of the TASO compiler using equality saturation☆135Updated 4 years ago
- An attempt at safe imperative GPU programming.☆58Updated 2 months ago
- A description of Minotaur can be found in https://arxiv.org/abs/2306.00229.☆113Updated last month
- Exploring the scalable matrix extension of the Apple M4 processor☆208Updated 11 months ago
- Super fast FP32 matrix multiplication on RDNA3☆78Updated 7 months ago
- Embedded Universal DSL: a good DSL for us, by us☆54Updated this week
- MLIR metal dialect☆33Updated last year
- PTX on XPUs☆72Updated 2 weeks ago
- The missing pieces (as far as boilerplate reduction goes) of the upstream MLIR python bindings.☆111Updated 3 weeks ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19Updated last year
- Asynchronous Rust bindings for UCX☆77Updated 6 months ago
- Unit benchmarks of CUDA event APIs.☆17Updated last year
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆26Updated last year
- MPIWasm is a WebAssembly Embedder based on Wasmer that enables the high-performance execution of MPI applications compiled to Wasm. (ACM …☆20Updated last year
- An educational implementation of a modern compressor in Rust☆48Updated 2 years ago
- A pure, low-level tensor program representation enabling tensor program optimization via program rewriting. See the web demo at https://g…☆70Updated 5 months ago
- The repo for HotOS paper "FIFO can be Better than LRU: the Power of Lazy Promotion and Quick Demotion"☆34Updated 2 years ago
- An operation-log based approach for data replication.☆65Updated 2 years ago
- GPUOcelot: A dynamic compilation framework for PTX☆211Updated 8 months ago
- Fast WebAssembly Baseline Compiler☆60Updated 2 years ago
- Spectre V1 Proof-of-Concept Attack in the Rust Language☆25Updated 7 months ago
- A zero-copy serialization library and networking stack.☆49Updated last year