VivekPanyam / cudaparsersLinks
Parsers for CUDA binary files
☆25Updated last year
Alternatives and similar repositories for cudaparsers
Users that are interested in cudaparsers are comparing it to the libraries listed below
Sorting:
- Intel® Instrumentation and Tracing Technology (ITT) and Just-In-Time (JIT) APIs☆121Updated last week
- Tenstorrent system interface library☆31Updated last week
- A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.☆20Updated 2 years ago
- Virtual machine for executing CUDA PTX without a GPU☆38Updated last year
- VectorVisor is a vectorizing binary translator for GPUs, designed to make it easy to run many copies of a single-threaded WebAssembly pro…☆155Updated last year
- A description of Minotaur can be found in https://arxiv.org/abs/2306.00229.☆112Updated 3 weeks ago
- ☆83Updated this week
- SquirrelFS: A crash-consistent Rust file system for persistent memory (OSDI 24)☆63Updated 5 months ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated last year
- MLIR metal dialect☆31Updated last year
- Rust bindings to the MLIR C API.☆67Updated 3 weeks ago
- PTX on XPUs☆66Updated this week
- The repo for HotOS paper "FIFO can be Better than LRU: the Power of Lazy Promotion and Quick Demotion"☆34Updated 2 years ago
- NVidia sass disassembler/inline patcher☆29Updated this week
- Unit benchmarks of CUDA event APIs.☆17Updated last year
- Tools and experiments for 0sim. Simulate system software behavior on machines with terabytes of main memory from your desktop.☆21Updated 5 years ago
- Embedded Universal DSL: a good DSL for us, by us☆46Updated last week
- Resource Allocation for Dynamic Demands☆20Updated last year
- An attempt at safe imperative GPU programming.☆55Updated last month
- Exploring the scalable matrix extension of the Apple M4 processor☆206Updated 11 months ago
- A zero-copy serialization library and networking stack.☆48Updated last year
- A verified library of synchronization primitives and concurrent data structures☆38Updated last month
- ☆18Updated 4 months ago
- ☆11Updated 2 years ago
- A enumerator for MLIR, relying on the information given by IRDL.☆18Updated this week
- eRPC library for Rust☆14Updated 5 years ago
- Benchmarking suite for Google workloads☆130Updated 2 weeks ago
- Rex is a safe and usable kernel extension framework that allows loading and executing Rust kernel extension programs in the place of eBPF…☆91Updated 2 weeks ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19Updated last year
- Experimental kernel with built-in replication.☆160Updated 2 months ago