ray-project / llmperf
LLMPerf is a library for validating and benchmarking LLMs
☆749Updated 2 months ago
Alternatives and similar repositories for llmperf:
Users that are interested in llmperf are comparing it to the libraries listed below
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆977Updated this week
- The Triton TensorRT-LLM Backend☆779Updated this week
- A throughput-oriented high-performance serving framework for LLMs☆737Updated 4 months ago
- Serving multiple LoRA finetuned LLM as one☆1,028Updated 9 months ago
- ☆448Updated last year
- [NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which r…