FlagOpen / FlagPerfLinks

FlagPerf is an open-source software platform for benchmarking AI chips.

☆352

Alternatives and similar repositories for FlagPerf

Users that are interested in FlagPerf are comparing it to the libraries listed below

Sorting:

FlagOpen / FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
☆364Updated this week
DeepLink-org / deeplink.framework
☆70Updated 11 months ago
antgroup / glake
GLake: optimizing GPU memory management and IO transmission.
☆483Updated 7 months ago
Tencent / KsanaLLM
☆508Updated last month
alibaba / rtp-llm
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
☆903Updated this week
zhihu / ZhiLight
A highly optimized LLM inference acceleration engine for Llama and its variants.
☆902Updated 3 months ago
OpenPPL / ppl.llm.serving
☆129Updated 10 months ago
FlagOpen / FlagGems
FlagGems is an operator library for large language models implemented in the Triton Language.
☆703Updated this week
FlagOpen / FlagCX
☆91Updated last week
bytedance / ByteMLPerf
AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…
☆265Updated 2 months ago
DeepLink-org / DIOPI
☆75Updated 11 months ago
intel / xFasterTransformer
☆430Updated last month
Qcompiler / MIXQ
MIXQ: Taming Dynamic Outliers in Mixed-Precision Quantization by Online Prediction
☆93Updated 11 months ago
intelligent-machine-learning / dlrover
DLRover: An Automatic Distributed Deep Learning System
☆1,571Updated last week
bytedance / ABQ-LLM
An acceleration library that supports arbitrary bit-width combinatorial quantization operations
☆236Updated last year
LLMServe / DistServe
Disaggregated serving system for Large Language Models (LLMs).
☆709Updated 6 months ago
modelscope / dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …
☆266Updated 2 months ago
bytedance / ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
☆479Updated last year
aliyun / SimAI
☆701Updated last month
OpenPPL / ppl.nn.llm
☆139Updated last year
alibaba / EasyParallelLibrary
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
☆269Updated 2 years ago
AlibabaPAI / llumnix
Efficient and easy multi-instance LLM serving
☆502Updated last month
bytedance / flux
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
☆1,153Updated last month
OpenPPL / ppl.llm.kernel.cuda
☆150Updated 9 months ago
Ascend / pytorch
Ascend PyTorch adapter (torch_npu). Mirror of https://gitee.com/ascend/pytorch
☆444Updated last month
feifeibear / LLMRoofline
Compare different hardware platforms via the Roofline Model for LLM inference tasks.
☆116Updated last year
DeepLink-org / dlinfer
☆64Updated this week
sgl-project / sgl-learning-materials
Materials for learning SGLang
☆618Updated 3 weeks ago
alibaba / BladeDISC
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
☆901Updated 9 months ago
volcengine / veScale
A PyTorch Native LLM Training Framework
☆875Updated last month