☆43Dec 4, 2025Updated 3 months ago
Alternatives and similar repositories for vllm-openvino
Users that are interested in vllm-openvino are comparing it to the libraries listed below
Sorting:
- OpenVINO Tokenizers extension☆49Mar 12, 2026Updated last week
- Edge Insights for Vision (eiv) is a package that helps to auto install Intel® GPU drivers and setup environment for Inference application…☆21Sep 29, 2025Updated 5 months ago
- DLL注入工具☆12Nov 9, 2020Updated 5 years ago
- PM Workshop China☆10Apr 11, 2019Updated 6 years ago
- Google DeepMind: Mixture of Depths Unofficial Implementation.☆12May 29, 2024Updated last year
- ☆21Jul 3, 2024Updated last year
- SPDK fork of nvme-cli. No longer supported - use standard nvme-cli with SPDK nvme CUSE instead. See https://spdk.io/doc/nvme.html#nvme_…☆15Apr 10, 2024Updated last year
- wirefisher: eBPF-powered traffic monitoring and control with precise per-process, IP, and port-level filtering, plus built-in rate limiti…☆38Dec 26, 2025Updated 2 months ago
- ☆15Mar 13, 2026Updated last week
- ☆19Jan 28, 2026Updated last month
- ☆17Mar 4, 2026Updated 2 weeks ago
- ☆15Jun 26, 2024Updated last year
- Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5☆16Sep 19, 2024Updated last year
- Unreal Engine 5 3D Platformer game prototype☆17May 27, 2024Updated last year
- This is a clone of an SVN repository at http://pagecache-mangagement.googlecode.com/svn/trunk. It had been cloned by http://svn2github.co…☆11May 23, 2013Updated 12 years ago
- PilotFish harvests the free GPU cycles of cloud gaming with deep learning training☆14Jul 2, 2022Updated 3 years ago
- ☆12Jan 7, 2023Updated 3 years ago
- Intel® AI for Enterprise Inference optimizes AI inference services on Intel hardware using Kubernetes Orchestration. It automates LLM mod…☆38Mar 11, 2026Updated last week
- Memory Address Tracer☆14Jun 12, 2020Updated 5 years ago
- SQL Server on OpenShift Workshop☆15Jun 27, 2023Updated 2 years ago
- Operator installing the Telemetry stack in a Kubernetes cluster and installing the metrics and alerts☆18Nov 3, 2023Updated 2 years ago
- ☆21Jan 12, 2026Updated 2 months ago
- ☆13Jan 16, 2026Updated 2 months ago
- MRRealmResultsController is an alternative to NSFetchedResultsController for use with realm-cocoa.☆12Apr 6, 2016Updated 9 years ago
- Add genai backend for ollama to run generative AI models using OpenVINO Runtime.☆23Jun 20, 2025Updated 9 months ago
- Luthier, a GPU binary instrumentation tool for AMD GPUs☆27Mar 13, 2026Updated last week
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆42May 13, 2025Updated 10 months ago
- ☆23Dec 5, 2025Updated 3 months ago
- RISC-V vector and tensor compute extensions for Vortex GPGPU acceleration for ML workloads. Optimized for transformer models, CNNs, and g…☆22Apr 25, 2025Updated 10 months ago
- ☆17Dec 27, 2024Updated last year
- ☆22Dec 27, 2022Updated 3 years ago
- ☆26Mar 14, 2024Updated 2 years ago
- Generate Linux Perf event tables for Apple Silicon☆17Dec 16, 2025Updated 3 months ago
- RISC-V-based many-core neuromorphic architecture☆16Aug 3, 2025Updated 7 months ago
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆25Aug 27, 2025Updated 6 months ago
- ☆31May 28, 2024Updated last year
- Pure Storage SQL Server script repository.☆33Jan 8, 2026Updated 2 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Compiler plugin for performance analysis of HIP applications☆13Apr 7, 2025Updated 11 months ago