sammysun0711 / ov_llm_benchLinks
OpenVINO LLM Benchmark
☆11Updated 2 years ago
Alternatives and similar repositories for ov_llm_bench
Users that are interested in ov_llm_bench are comparing it to the libraries listed below
Sorting:
- MAD (Model Automation and Dashboarding)☆30Updated last week
- High-Performance Linpack Benchmark adopted version for GPU backend☆12Updated 3 years ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆63Updated 5 months ago
- ☆24Updated 2 months ago
- Fast SGEMM emulation on Tensor Cores☆16Updated 10 months ago
- ☆22Updated last month
- oneCCL Bindings for Pytorch* (deprecated)☆104Updated last month
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆59Updated this week
- ☆67Updated last week
- oneAPI Level Zero Conformance & Performance test content☆59Updated last week
- MLPerf™ logging library☆37Updated last week
- ☆54Updated this week
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆41Updated last year
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 4 months ago
- Intel® Tensor Processing Primitives extension for Pytorch*☆17Updated last week
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆16Updated last year
- A recommendation model kernel optimizing system☆12Updated 6 months ago
- LLM-Inference-Bench☆56Updated 5 months ago
- Fast GPU based tensor core reductions☆13Updated 2 years ago
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆169Updated 3 months ago
- ☆11Updated 9 months ago
- Cosmic Tagging Network for Neutrino Physics☆13Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- Get started with your NVIDIA Arm HPC Developers Kit!☆33Updated 2 years ago
- ☆71Updated 9 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 10 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆14Updated 3 months ago
- An HPL-AI implementation for Fugaku☆22Updated 4 years ago
- monorepo for rocm libraries☆216Updated this week
- Bandwidth test for ROCm☆72Updated 2 weeks ago