haozixu / htp-ops-libLinks
Self-implemented NN operators for Qualcomm's Hexagon NPU
☆29Updated last month
Alternatives and similar repositories for htp-ops-lib
Users that are interested in htp-ops-lib are comparing it to the libraries listed below
Sorting:
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆51Updated last year
- A quantization algorithm for LLM☆146Updated last year
- Benchmark code for the "Online normalizer calculation for softmax" paper