opea-project / Enterprise-InferenceLinks

Intel® AI for Enterprise Inference optimizes AI inference services on Intel hardware using Kubernetes Orchestration. It automates LLM model deployment for faster inference, resource provisioning, and optimal settings to simplify processes and reduce manual work.

☆23

Alternatives and similar repositories for Enterprise-Inference

Users that are interested in Enterprise-Inference are comparing it to the libraries listed below

Sorting:

NVIDIA / nim-anywhere
Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench
☆180Updated 5 months ago
NVIDIA / nim-deploy
A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…
☆194Updated last week
intel / ai-containers
This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …
☆52Updated last week
langchain-ai / langchain-nvidia
☆168Updated last week
opea-project / GenAIStudio
GenAI Studio is a low code platform to enable users to construct, evaluate, and benchmark GenAI applications. The platform also provide c…
☆50Updated last month
NVIDIA / workbench-example-mistral-finetune
An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model
☆63Updated last year
opea-project / docs
This repo contains documents of the OPEA project
☆44Updated last month
opea-project / GenAIEval
Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety…
☆37Updated 2 weeks ago
instructlab / training
InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data
☆43Updated this week
opea-project / GenAIComps
GenAI components at micro-service level; GenAI service composer to create mega-service
☆178Updated last week
ibm-granite / watsonx-code-assistant-individual
For individual users, watsonx Code Assistant can access a local IBM Granite model
☆35Updated 3 months ago
sambanova / tutorials
☆13Updated last year
NVIDIA-AI-Blueprints / data-flywheel
☆93Updated 2 months ago
ibm-granite-community / granite-retrieval-agent
Build Research and Rag agents with Granite on your laptop
☆147Updated last week
neo4j-examples / neo4j-gcp-vertex-ai-langchain
Neo4j Extensions and Integrations with Vertex AI and LangChain
☆27Updated 6 months ago
pavanjava / llama_workflow_and_agents
This repository is a combination of llama workflows and agents together which is a powerful concept.
☆17Updated last year
NVIDIA-AI-Blueprints / aiq-research-assistant
☆193Updated this week
instructlab / instructlab-bot
GitHub bot to assist with the taxonomy contribution workflow
☆17Updated 11 months ago
NVIDIA-AI-Blueprints / ai-virtual-assistant
Customizable, AI-driven virtual assistant designed to streamline customer service operations, handle common inquiries, and improve overal…
☆198Updated 3 months ago
LazaUK / AIFoundry-DeepSeek-SDK
Notebooks to demo the use of Azure AI Python SDK / LangChain with DeepSeek R1 reasoning model in Azure AI Foundry.
☆30Updated 8 months ago
skypilot-org / skypilot-tutorial
Tutorial to get started with SkyPilot!
☆57Updated last year
neuralmagic / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆15Updated last week
NVIDIA-AI-Blueprints / llm-router
Route LLM requests to the best model for the task at hand.
☆109Updated 3 weeks ago
nicknochnack / ACPxMCPxWatsonx
How to build an ACP compliant agent that uses MCP as well!
☆11Updated 5 months ago
sinanuozdemir / oreilly-multimodal-ai
Learn how multimodal AI merges text, image, and audio for smarter models
☆26Updated 8 months ago
project-codeflare / codeflare
Simplifying the definition and execution, scaling and deployment of pipelines on the cloud.
☆234Updated 2 years ago
intel / ai
Explore our open source AI portfolio! Develop, train, and deploy your AI solutions with performance- and productivity-optimized tools fro…
☆53Updated 6 months ago
IntelSoftware / Machine-Learning-using-oneAPI
Machine Learning using oneAPI. Explores Intel Extensions for scikit-learn* and NumPy, SciPy, Pandas powered by oneAPI
☆41Updated last year
GaoDalie / Talk_with_CSV
Talk to your CSV: how to Visualize Your Data with Langchain and Streamlit
☆29Updated 2 years ago
groq / groqflow
GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…
☆112Updated 2 months ago