modelscope / dash-infer

DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
231Updated last week

Alternatives and similar repositories for dash-infer:

Users that are interested in dash-infer are comparing it to the libraries listed below