DeepLink-org / AIChipBenchmark
☆24Updated last month
Alternatives and similar repositories for AIChipBenchmark:
Users that are interested in AIChipBenchmark are comparing it to the libraries listed below
- ☆140Updated 9 months ago
- ☆142Updated 3 weeks ago
- ☆127Updated last month
- ☆57Updated 2 months ago
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆219Updated 2 weeks ago
- ☆106Updated 10 months ago
- 动手学习TVM核心原理教程☆59Updated 4 years ago
- Tutorials for writing high-performance GPU operators in AI frameworks.☆127Updated last year
- FlagScale is a large model toolkit based on open-sourced projects.☆209Updated this week
- ☆94Updated 3 years ago
- NART = NART is not A RunTime, a deep learning inference framework.☆38Updated last year
- ☆19Updated 3 years ago
- ☆33Updated 3 months ago
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.☆91Updated 10 months ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆79Updated last year
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆88Updated 11 months ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆99Updated 4 months ago
- This is a demo how to write a high performance convolution run on apple silicon☆52Updated 2 years ago
- Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.☆34Updated 4 months ago
- ☆79Updated 4 months ago
- llm theoretical performance analysis tools and support params, flops, memory and latency analysis.☆76Updated 3 weeks ago
- The DeepSpark open platform selects hundreds of open source application algorithms and models that are deeply coupled with industrial app…☆42Updated last month
- Benchmark scripts for TVM☆73Updated 2 years ago
- FlagGems is an operator library for large language models implemented in Triton Language.☆407Updated this week
- ☆39Updated last week
- Transformer related optimization, including BERT, GPT☆59Updated last year
- ☆23Updated last year
- play gemm with tvm☆85Updated last year
- OneFlow->ONNX☆42Updated last year
- ☆19Updated 4 years ago