triton-inference-server / fil_backend
FIL backend for the Triton Inference Server
☆72Updated this week
Related projects ⓘ
Alternatives and complementary repositories for fil_backend
- Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature sto…☆90Updated 5 months ago
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆185Updated 2 months ago
- The Triton backend for the ONNX Runtime.☆133Updated this week
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…☆66Updated last year
- ☆51Updated last year
- Productionize machine learning predictions, with ONNX or without☆66Updated 10 months ago
- Distributed XGBoost on Ray☆144Updated 4 months ago
- RAPIDS GPU-BDB☆107Updated 8 months ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆18Updated 2 years ago
- MLFlow Deployment Plugin for Ray Serve☆42Updated 2 years ago
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆434Updated this week
- Incubating project for xgboost operator☆76Updated 2 years ago
- The Triton backend for the PyTorch TorchScript models.☆127Updated this week
- Plugin for deploying MLflow models to TorchServe☆106Updated last year
- The deepr module provide abstractions (layers, readers, prepro, metrics, config) to help build tensorflow models on top of tf estimators☆51Updated last year
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated last year
- Home for OctoML PyTorch Profiler☆107Updated last year
- Provide Python access to the NVML library for GPU diagnostics☆220Updated 3 months ago
- The Triton backend that allows running GPU-accelerated data pre-processing pipelines implemented in DALI's python API.☆125Updated 2 weeks ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆146Updated this week
- An Aspiring Drop-In Replacement for Pandas at Scale☆74Updated 3 years ago
- MLOps pipeline for NVIDIA Merlin on GKE☆41Updated 3 years ago
- MLPerf™ logging library☆30Updated this week
- scikit-learn_bench benchmarks various implementations of machine learning algorithms across data analytics frameworks. It currently suppo…☆113Updated last week
- The core library and APIs implementing the Triton Inference Server.☆105Updated this week
- Ray - A curated list of resources: https://github.com/ray-project/ray☆42Updated last year
- Distributed preprocessing and data loading for language datasets☆39Updated 7 months ago
- Dockerfile templates for creating RAPIDS Docker Images☆70Updated this week
- MLOps Python Library☆116Updated 2 years ago
- Python bindings for UCX☆121Updated this week