VivekPanyam / cudaparsersLinks
Parsers for CUDA binary files
☆22Updated last year
Alternatives and similar repositories for cudaparsers
Users that are interested in cudaparsers are comparing it to the libraries listed below
Sorting:
- Heterogeneous Containerization of Large Language Model Apps☆45Updated 2 weeks ago
- An educational implementation of a modern compressor in Rust☆47Updated last year
- eRPC library for Rust☆14Updated 5 years ago
- An attempt at safe imperative GPU programming.☆45Updated last week
- Rust bindings to the MLIR C API.☆65Updated 2 weeks ago
- Tools and experiments for 0sim. Simulate system software behavior on machines with terabytes of main memory from your desktop.☆21Updated 5 years ago
- Loupe: Syscall Usage Analysis Tool☆37Updated this week
- Virtual machine for executing CUDA PTX without a GPU☆35Updated last year
- VectorVisor is a vectorizing binary translator for GPUs, designed to make it easy to run many copies of a single-threaded WebAssembly pro…☆150Updated 9 months ago
- A description of Minotaur can be found in https://arxiv.org/abs/2306.00229.☆108Updated 10 months ago
- ☆77Updated this week
- The repo for HotOS paper "FIFO can be Better than LRU: the Power of Lazy Promotion and Quick Demotion"☆33Updated 2 years ago
- Fast WebAssembly Baseline Compiler☆56Updated 2 years ago
- A parser for PTX 6.5☆11Updated 2 years ago
- A verified library of synchronization primitives and concurrent data structures☆35Updated last month
- Embedded Universal DSL: a good DSL for us, by us☆38Updated this week
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆26Updated 6 months ago
- MPIWasm is a WebAssembly Embedder based on Wasmer that enables the high-performance execution of MPI applications compiled to Wasm. (ACM …☆19Updated last year
- Exploring the scalable matrix extension of the Apple M4 processor☆180Updated 7 months ago
- Verification and optimization tool for concurrent code☆25Updated 2 months ago
- GKLEE is a symbolic analyser and test generator tailored for CUDA C++ programs☆38Updated 4 years ago
- ☆13Updated 4 years ago
- A pure, low-level tensor program representation enabling tensor program optimization via program rewriting. See the web demo at https://g…☆70Updated 3 weeks ago
- A Rust RDMA library.☆16Updated last week
- Tenstorrent system interface library☆24Updated this week
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 8 months ago
- Asynchronous Rust bindings for UCX☆68Updated last month
- A zero-copy serialization library and networking stack.☆47Updated last year
- 🦙🦙.🦀☆27Updated last year
- SquirrelFS: A crash-consistent Rust file system for persistent memory (OSDI 24)☆60Updated last month