NVIDIA / nim-anywhere
Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench
☆69Updated 2 weeks ago
Related projects: ⓘ
- End-to-End LLM Guide☆91Updated 2 months ago
- NVIDIA Ingest is a set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise docume…☆32Updated this week
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆120Updated last week
- NIM Agent Blueprint for multimodal PDF extraction☆29Updated 2 weeks ago
- An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model☆44Updated 3 months ago
- ☆49Updated this week
- Collection of reference workflows for building intelligent agents with NIMs☆75Updated this week
- ☆33Updated this week
- This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows inste…☆111Updated 6 months ago
- Including Hugging Face Deep learning Containers for Google Cloud☆106Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆87Updated this week
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆113Updated 7 months ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆194Updated last week
- Self-host LLMs with vLLM and BentoML☆62Updated this week
- The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PC…☆95Updated 3 weeks ago
- Large Language Model Hosting Container☆75Updated 2 weeks ago
- Infrastructure as code for GPU accelerated managed Kubernetes clusters.☆45Updated 4 months ago
- Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open…☆220Updated this week
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆231Updated last week
- ☆113Updated 9 months ago
- Triton CLI is an open source command line interface that enables users to create, deploy, and profile models served by the Triton Inferen…☆47Updated 2 weeks ago
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆23Updated this week
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆129Updated last month
- Tutorial for building LLM router☆145Updated 2 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆95Updated this week
- Deploy and Scale LLM-based applications☆26Updated last year
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆144Updated this week
- ☆15Updated 3 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆55Updated last month
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆115Updated last year