opea-project / Enterprise-InferenceLinks
Intel® AI for Enterprise Inference optimizes AI inference services on Intel hardware using Kubernetes Orchestration. It automates LLM model deployment for faster inference, resource provisioning, and optimal settings to simplify processes and reduce manual work.
☆14Updated last week
Alternatives and similar repositories for Enterprise-Inference
Users that are interested in Enterprise-Inference are comparing it to the libraries listed below
Sorting:
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆48Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Updated this week
- Large Language Model Text Generation Inference on Habana Gaudi☆34Updated 3 months ago
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆185Updated 2 weeks ago
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆43Updated this week
- Starter template to develop agents for the beeai platform☆13Updated last week
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆108Updated 2 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆129Updated last week
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆174Updated 2 months ago
- For individual users, watsonx Code Assistant can access a local IBM Granite model☆34Updated 3 weeks ago
- Route LLM requests to the best model for the task at hand.☆81Updated 3 weeks ago
- ☆12Updated last year
- Setup and Installation Instructions for Habana binaries, docker image creation☆25Updated last month
- ☆62Updated last month
- GenAI Studio is a low code platform to enable users to construct, evaluate, and benchmark GenAI applications. The platform also provide c…☆45Updated 2 weeks ago
- PARIS (Perpetual Adaptive Regenerative Intelligence System) is a conceptual model for building and managing effective AI and Language Mod…☆24Updated 2 years ago
- An NVIDIA AI Workbench example project for exploring the RAPIDS cuDF library☆16Updated 2 months ago
- GenAI components at micro-service level; GenAI service composer to create mega-service☆163Updated this week
- How to build an ACP compliant agent that uses MCP as well!☆11Updated 2 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆61Updated 2 months ago
- Streamlit Web UI for AGiXT☆27Updated 3 weeks ago
- Samples, tutorials and other information about watsonx.data☆24Updated 2 weeks ago
- Evaluation, benchmark, and scorecard, targeting for performance on throughput and latency, accuracy on popular evaluation harness, safety…☆37Updated last week
- AI21 Python SDK☆64Updated 2 weeks ago
- HyDE based RAG using NVIDIA NIM.☆16Updated last year
- This repo contains documents of the OPEA project☆42Updated last week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆73Updated 2 weeks ago
- ☆18Updated 10 months ago
- Customizable, AI-driven virtual assistant designed to streamline customer service operations, handle common inquiries, and improve overal…☆151Updated 2 weeks ago
- ☆37Updated last week