triton-inference-server / redis_cacheLinks
TRITONCACHE implementation of a Redis cache
☆16Updated 3 weeks ago
Alternatives and similar repositories for redis_cache
Users that are interested in redis_cache are comparing it to the libraries listed below
Sorting:
- Module, Model, and Tensor Serialization/Deserialization☆277Updated 3 months ago
- Triton backend for managing the model state tensors automatically in sequence batcher☆18Updated last year
- TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…☆402Updated last week
- The Triton backend for the ONNX Runtime.☆168Updated this week
- A minimal shared memory object store design☆54Updated 9 years ago
- Home for OctoML PyTorch Profiler☆114Updated 2 years ago
- ☆42Updated this week
- Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.☆113Updated last week
- MLFlow Deployment Plugin for Ray Serve☆46Updated 3 years ago
- ☆57Updated last week
- Simple dependency injection framework for Python☆21Updated last year
- Ray-based Apache Beam runner☆42Updated 2 years ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Updated 3 years ago
- The Triton backend for the PyTorch TorchScript models.☆166Updated last week
- Unified storage framework for the entire machine learning lifecycle☆155Updated last year
- ☆148Updated 3 weeks ago
- TorchFix - a linter for PyTorch-using code with autofix support☆151Updated 3 months ago
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆454Updated 3 weeks ago
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆103Updated this week
- A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…☆18Updated 2 years ago
- benchmarking some transformer deployments☆26Updated last week
- 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.☆64Updated 10 months ago
- MLPerf™ logging library☆37Updated last month
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆161Updated 2 months ago
- Repository for open inference protocol specification☆60Updated 6 months ago
- Python bindings for UCX☆140Updated 2 months ago
- FIL backend for the Triton Inference Server☆83Updated this week
- Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.☆213Updated 7 months ago
- Benchmark suite for LLMs from Fireworks.ai☆84Updated last week
- A storage solution for PyTorch tensors with distributed tensor support.☆45Updated 2 weeks ago