sammysun0711 / ov_llm_bench
OpenVINO LLM Benchmark
☆11Updated last year
Alternatives and similar repositories for ov_llm_bench:
Users that are interested in ov_llm_bench are comparing it to the libraries listed below
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated 2 months ago
- oneCCL Bindings for Pytorch*☆95Updated 2 weeks ago
- A CUTLASS implementation using SYCL☆20Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆62Updated 2 months ago
- Intel® Tensor Processing Primitives extension for Pytorch*☆15Updated 2 weeks ago
- ☆30Updated this week
- ☆20Updated last week
- ☆422Updated this week
- OpenVINO backend for Triton.☆31Updated this week
- Ongoing research training transformer models at scale☆19Updated last week
- OpenVINO Tokenizers extension☆32Updated 2 weeks ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 4 months ago
- ☆20Updated last month
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆269Updated this week
- Sample examples of how to call collective operation functions on multi-GPU environments. A simple example of using broadcast, reduce, all…☆33Updated last year
- PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolution…☆19Updated this week
- ☆60Updated 4 months ago
- ☆47Updated 3 weeks ago
- MLPerf™ logging library☆36Updated 2 weeks ago
- Tools for easier OpenVINO development/debugging☆10Updated last month
- Provides the examples to write and build Habana custom kernels using the HabanaTools☆21Updated 3 weeks ago
- ☆28Updated 3 months ago
- ☆44Updated this week
- Parallel selection on GPUs☆16Updated 4 years ago
- Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver☆39Updated last week
- High-Performance Linpack Benchmark adopted version for GPU backend☆11Updated 2 years ago
- oneAPI Collective Communications Library (oneCCL)☆232Updated last week
- ☆11Updated last month
- ☆8Updated 8 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆33Updated last month