sammysun0711 / ov_llm_benchLinks
OpenVINO LLM Benchmark
☆11Updated last year
Alternatives and similar repositories for ov_llm_bench
Users that are interested in ov_llm_bench are comparing it to the libraries listed below
Sorting:
- A CUTLASS implementation using SYCL☆27Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated last month
- OpenVINO Tokenizers extension☆36Updated last week
- OpenVINO backend for Triton.☆32Updated last week
- oneCCL Bindings for Pytorch*☆97Updated 2 months ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated last week
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 6 months ago
- ☆38Updated this week
- ☆21Updated last month
- ☆62Updated 6 months ago
- Intel® Tensor Processing Primitives extension for Pytorch*☆17Updated last week
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆40Updated 11 months ago
- ☆20Updated 3 months ago
- ☆25Updated this week
- ☆46Updated this week
- Library for modelling performance costs of different Neural Network workloads on NPU devices☆34Updated last week
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated last month
- oneAPI Level Zero Conformance & Performance test content☆54Updated this week
- ☆19Updated this week
- Ongoing research training transformer models at scale☆23Updated 2 weeks ago
- PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolution…☆19Updated 3 weeks ago
- High-Performance Linpack Benchmark adopted version for GPU backend☆11Updated 2 years ago
- Large Language Model Text Generation Inference on Habana Gaudi☆33Updated 3 months ago
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆42Updated 4 months ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆22Updated 2 weeks ago
- ☆29Updated 4 months ago
- ☆47Updated 3 weeks ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆164Updated last month
- Ahead of Time (AOT) Triton Math Library☆66Updated last week