ModelTC / awesome-lm-systemLinks
Summary of system papers/frameworks/codes/tools on training or serving large model
☆57Updated 2 years ago
Alternatives and similar repositories for awesome-lm-system
Users that are interested in awesome-lm-system are comparing it to the libraries listed below
Sorting:
- ☆85Updated 9 months ago
- Odysseus: Playground of LLM Sequence Parallelism☆79Updated last year
- An easy-to-use package for implementing SmoothQuant for LLMs☆110Updated 9 months ago
- PyTorch bindings for CUTLASS grouped GEMM.☆141Updated 8 months ago
- GPTQ inference TVM kernel☆41Updated last year
- ☆117Updated 8 months ago
- Quantized Attention on GPU☆44Updated last year
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.