opea-project/GenAIComps

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/opea-project/GenAIComps)

opea-project / GenAIComps

GenAI components at micro-service level; GenAI service composer to create mega-service

☆198

Alternatives and similar repositories for GenAIComps

Users that are interested in GenAIComps are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

opea-project / GenAIEval
View on GitHub
Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety…
☆42Jul 6, 2026Updated 2 weeks ago
opea-project / GenAIExamples
View on GitHub
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open…
☆736Updated this week
opea-project / GenAIInfra
View on GitHub
Containerization and cloud native suite for OPEA
☆74Jul 6, 2026Updated 2 weeks ago
huggingface / optimum-habana
View on GitHub
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
☆212Jul 6, 2026Updated 2 weeks ago
intel / document-automation
View on GitHub
Document Automation Reference Kit
☆16Jun 27, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HabanaAI / vllm-fork
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆90Jul 13, 2026Updated last week
intel / neural-speed
View on GitHub
An innovative library for efficient LLM inference via low-bit quantization
☆352Aug 30, 2024Updated last year
huggingface / optimum-intel
View on GitHub
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
☆608Updated this week
intel / intel-extension-for-openxla
View on GitHub
☆61Mar 6, 2026Updated 4 months ago
intel / intel-extension-for-transformers
View on GitHub
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…
☆2,176Oct 8, 2024Updated last year
opea-project / Enterprise-Inference
View on GitHub
Intel® AI for Enterprise Inference optimizes AI inference services on Intel hardware using Kubernetes Orchestration. It automates LLM mod…
☆44Jul 8, 2026Updated 2 weeks ago
intel / intel-extension-for-pytorch
View on GitHub
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
☆2,014Mar 30, 2026Updated 3 months ago
HabanaAI / hccl_demo
View on GitHub
☆26Oct 9, 2025Updated 9 months ago
logan-markewich / bm25-rs
View on GitHub
Efficient BM25 indexing using rust
☆19Sep 17, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
intel / ai-containers
View on GitHub
This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …
☆79May 27, 2026Updated last month
intel / neural-compressor
View on GitHub
SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …
☆2,684Updated this week
intel / credit-card-fraud-detection
View on GitHub
☆15Mar 3, 2025Updated last year
IntelLabs / RAG-FiT
View on GitHub
Framework for enhancing LLMs for RAG tasks using fine-tuning.
☆768Jun 8, 2026Updated last month
intel / torch-xpu-ops
View on GitHub
☆99Updated this week
openvinotoolkit / openvino.genai
View on GitHub
Run Generative AI models with simple C++/Python API and using OpenVINO Runtime
☆557Updated this week
pavanjava / mixture_of_workflows
View on GitHub
this is a repository that gives the power of mixture of workflows a concept inspired by the mixture of agents.
☆13Aug 19, 2024Updated last year
openvinotoolkit / openvino_testdrive
View on GitHub
With OpenVINO Test Drive, users can run large language models (LLMs) and models trained by Intel Geti on their devices, including AI PCs …
☆39Mar 12, 2026Updated 4 months ago
HabanaAI / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆14Jan 8, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
intel / intel-xpu-backend-for-triton
View on GitHub
OpenAI Triton backend for Intel® GPUs
☆261Updated this week
sammysun0711 / ov_llm_bench
View on GitHub
OpenVINO LLM Benchmark
☆11Dec 7, 2023Updated 2 years ago
intel / enterprise-agent-toolkit
View on GitHub
A comprehensive, enterprise-ready toolkit for deploying Agentic AI systems on Intel® Xeon processors and Intel accelerators. Built for or…
☆15Jul 1, 2026Updated 3 weeks ago
intel / disease-prediction
View on GitHub
Multi-Modal Disease Prediction
☆15Jun 27, 2024Updated 2 years ago
intel / torch-ccl
View on GitHub
oneCCL Bindings for Pytorch* (deprecated)
☆104Dec 31, 2025Updated 6 months ago
intel / intel-xai-tools
View on GitHub
Explainable AI Tooling (XAI). XAI is used to discover and explain a model's prediction in a way that is interpretable to the user. Releva…
☆39Sep 22, 2025Updated 10 months ago
intel / ai
View on GitHub
Explore our open source AI portfolio! Develop, train, and deploy your AI solutions with performance- and productivity-optimized tools fro…
☆77Mar 27, 2026Updated 3 months ago
mlcommons / inference_results_v3.1
View on GitHub
This repository contains the results and code for the MLPerf™ Inference v3.1 benchmark.
☆11Jul 24, 2025Updated last year
intel / xetla
View on GitHub
☆61Dec 18, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
openvinotoolkit / openvino_build_deploy
View on GitHub
Pre-built components and code samples to help you build and deploy production-grade AI applications with the OpenVINO™ Toolkit from Intel
☆217Updated this week
vllm-project / vllm-xpu-kernels
View on GitHub
The vLLM XPU kernels for Intel GPU
☆55Updated this week
onnx / steering-committee
View on GitHub
Notes and artifacts from the ONNX steering committee
☆29Updated this week
intel / svr-info
View on GitHub
Intel® System Health Inspector (aka svr-info) is a Linux command line tool used to assess the health of Intel® Xeon® processor-based serv…
☆60Dec 17, 2024Updated last year
FalkorDB / FalkorDB-core-rs
View on GitHub
FalkorDB port to Rust
☆13Mar 26, 2026Updated 3 months ago
tradel / cc-kube-sockshop
View on GitHub
Full end-to-end demo of Consul in Kubernetes, including Connect service mesh and Ambassador L7 gateway
☆11Apr 26, 2019Updated 7 years ago
Python-Markdown / github-links
View on GitHub
Python-Markdown GitHub Links Extension
☆15Sep 4, 2025Updated 10 months ago