Multi-V-VM / hetGPULinks
PTX on XPUs
☆58Updated last week
Alternatives and similar repositories for hetGPU
Users that are interested in hetGPU are comparing it to the libraries listed below
Sorting:
- PTX-EMU is a simple emulator for CUDA program.☆34Updated 5 months ago
- ☆175Updated last month
- Triton to TVM transpiler.☆22Updated 11 months ago
- Asynchronous semantics for architectural simulation and synthesis.☆49Updated last week
- A scheduling framework for multitasking over diverse XPUs, including GPUs, NPUs, ASICs, and FPGAs☆107Updated last week
- ☆85Updated 5 months ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 11 months ago
- LLVM OpenCL C compiler suite for ventus GPGPU☆54Updated last week
- WaferLLM: Large Language Model Inference at Wafer Scale☆55Updated last week
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19Updated last year
- Yet another toy CPU.☆92Updated last year
- Unofficial description of the CUDA assembly (SASS) instruction sets.☆143Updated 2 months ago
- Canvas: End-to-End Kernel Architecture Search in Neural Networks☆27Updated 10 months ago
- Fast OS-level support for GPU checkpoint and restore☆236Updated last month
- Handwritten GEMM using Intel AMX (Advanced Matrix Extension)☆16Updated 8 months ago
- matmul using AMX instructions☆19Updated last year
- Being a full-stack hacker, RISCV, LLVM, and more.☆18Updated 3 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆100Updated 2 years ago
- An MLIR-based toy DL compiler for TVM Relay.☆59Updated 2 years ago
- An experimental CPU backend for Triton☆153Updated 3 months ago
- Virtuoso is a fast, accurate and versatile simulation framework designed for virtual memory research. Virtuoso uses a new simulation met…☆71Updated 4 months ago
- A Top-Down Profiler for GPU Applications☆20Updated last year
- ☆29Updated last year
- Advanced Matrix Extensions (AMX) Guide☆98Updated 3 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆27Updated 9 months ago
- My Paper Reading Lists and Notes.☆20Updated 8 months ago
- A flexible, high-performance, user-friendly computer architecture simulator engine☆89Updated this week
- ☆57Updated 3 months ago
- GPU Performance Advisor☆66Updated 3 years ago
- GVProf: A Value Profiler for GPU-based Clusters☆52Updated last year