toy-ai-top / TritonLLMLinks

LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model
44Updated 3 weeks ago

Alternatives and similar repositories for TritonLLM

Users that are interested in TritonLLM are comparing it to the libraries listed below

Sorting: