ROCm / tensorcastLinks
☆14Updated this week
Alternatives and similar repositories for tensorcast
Users that are interested in tensorcast are comparing it to the libraries listed below
Sorting:
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆310Updated 4 months ago
- IREE plugin repository for the AMD AIE accelerator☆112Updated this week
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆424Updated last month
- ☆109Updated last year
- A Winograd Minimal Filter Implementation in CUDA☆28Updated 4 years ago
- ☆163Updated 2 years ago
- Dissecting NVIDIA GPU Architecture☆109Updated 3 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆138Updated 2 years ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆111Updated 11 months ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆89Updated 2 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆115Updated 3 years ago
- TileFlow is a performance analysis tool based on Timeloop for fusion dataflows☆62Updated last year
- ☆112Updated last week
- PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity☆117Updated 2 weeks ago
- ☆157Updated this week
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆180Updated 3 years ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆119Updated 5 months ago
- Assembler for NVIDIA Volta and Turing GPUs☆231Updated 3 years ago
- ☆47Updated 4 years ago
- IREE's PyTorch Frontend, based on Torch Dynamo.☆99Updated last week
- DietCode Code Release☆65Updated 3 years ago
- ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference☆157Updated 8 months ago
- OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library☆25Updated 5 years ago
- ☆50Updated 6 years ago
- TVM for Tenstorrent ASICs☆27Updated last month
- ☆46Updated 4 months ago
- A home for the final text of all TVM RFCs.☆109Updated last year
- ☆39Updated 5 years ago
- OSDI 2023 Welder, deeplearning compiler☆27Updated last year
- Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)☆15Updated 4 years ago