OpenMLIR / TritonLLMLinks

LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model
36Updated last week

Alternatives and similar repositories for TritonLLM

Users that are interested in TritonLLM are comparing it to the libraries listed below

Sorting: