triton-inference-server / redis_cacheLinks
TRITONCACHE implementation of a Redis cache
☆16Updated this week
Alternatives and similar repositories for redis_cache
Users that are interested in redis_cache are comparing it to the libraries listed below
Sorting:
- MLFlow Deployment Plugin for Ray Serve☆46Updated 3 years ago
- Module, Model, and Tensor Serialization/Deserialization☆272Updated 2 months ago
- xet client tech, used in huggingface_hub☆313Updated this week
- Triton backend for managing the model state tensors automatically in sequence batcher☆18Updated last year
- ☆41Updated last week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆110Updated 2 weeks ago
- Core Utilities for NVIDIA Merlin☆19Updated last year
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆399Updated this week
- ☆57Updated 3 weeks ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Updated 3 years ago
- A top-like tool for monitoring GPUs in a cluster☆85Updated last year
- ☆145Updated this week
- Home for OctoML PyTorch Profiler☆114Updated 2 years ago
- Ray-based Apache Beam runner☆41Updated 2 years ago
- Unified storage framework for the entire machine learning lifecycle☆155Updated last year
- PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…☆31Updated 2 years ago
- The Triton backend for the ONNX Runtime.☆163Updated this week
- FIL backend for the Triton Inference Server☆83Updated this week
- A collection of reproducible inference engine benchmarks☆37Updated 6 months ago
- ☆14Updated 3 years ago
- The Triton backend for the PyTorch TorchScript models.☆163Updated last week
- Distributed XGBoost on Ray☆152Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆61Updated last week
- TorchFix - a linter for PyTorch-using code with autofix support☆148Updated 2 months ago
- ☆12Updated last year
- Merlin Systems provides tools for combining recommendation models with other elements of production recommender systems (like feature sto…☆94Updated last year
- First token cutoff sampling inference example☆31Updated last year
- Repository for open inference protocol specification☆59Updated 5 months ago
- Simple dependency injection framework for Python☆21Updated last year
- ☆16Updated 2 months ago