VivekPanyam / cudaparsersLinks
Parsers for CUDA binary files
☆25Updated 2 years ago
Alternatives and similar repositories for cudaparsers
Users that are interested in cudaparsers are comparing it to the libraries listed below
Sorting:
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) APIs☆127Updated 2 weeks ago
- VectorVisor is a vectorizing binary translator for GPUs, designed to make it easy to run many copies of a single-threaded WebAssembly pro…☆158Updated last year
- Tenstorrent system interface library☆33Updated 3 weeks ago
- A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.☆21Updated 2 years ago
- A zero-copy serialization library and networking stack.☆50Updated last year
- ☆85Updated this week
- Rust bindings to the MLIR C API.☆69Updated last month
- Virtual machine for executing CUDA PTX without a GPU☆42Updated 2 years ago
- UB-aware interpreter for LLVM debugging☆31Updated 3 months ago
- An attempt at safe imperative GPU programming.☆61Updated 4 months ago
- A Rust library for safely programming persistent memory☆74Updated last year
- Benchmarking suite for Google workloads☆138Updated last week
- MLIR metal dialect☆35Updated last year
- Re-implementation of the TASO compiler using equality saturation☆138Updated 4 years ago
- Exploring the scalable matrix extension of the Apple M4 processor☆213Updated last year
- Flexible memory allocation tool for multi-tiered memory systems☆13Updated this week
- User-Mode Driver for Tenstorrent hardware☆36Updated this week
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19Updated last year
- Tools and experiments for 0sim. Simulate system software behavior on machines with terabytes of main memory from your desktop.☆21Updated 5 years ago
- The repo for HotOS paper "FIFO can be Better than LRU: the Power of Lazy Promotion and Quick Demotion"☆35Updated 2 years ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆27Updated last year
- MPIWasm is a WebAssembly Embedder based on Wasmer that enables the high-performance execution of MPI applications compiled to Wasm. (ACM …☆20Updated last year
- A case for representing data collections and objects in the LLVM IR☆21Updated 2 months ago
- An operation-log based approach for data replication.☆65Updated 2 years ago
- A lightweight memory allocator for hardware-accelerated machine learning☆179Updated 3 months ago
- A verified library of synchronization primitives and concurrent data structures☆40Updated 3 weeks ago
- Testing memory-level parallelism☆82Updated last year
- A description of Minotaur can be found in https://arxiv.org/abs/2306.00229.☆123Updated this week
- Embedded Universal DSL: a good DSL for us, by us☆60Updated this week
- Tutorial for LLVM Dev Conference 2019.☆15Updated 6 years ago