triton-inference-server / redis_cacheLinks

TRITONCACHE implementation of a Redis cache

☆16

Alternatives and similar repositories for redis_cache

Users that are interested in redis_cache are comparing it to the libraries listed below

Sorting:

coreweave / tensorizer
Module, Model, and Tensor Serialization/Deserialization
☆277Updated 3 months ago
triton-inference-server / stateful_backend
Triton backend for managing the model state tensors automatically in sequence batcher
☆18Updated last year
meta-pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆402Updated last week
triton-inference-server / onnxruntime_backend
The Triton backend for the ONNX Runtime.
☆168Updated this week
ray-project / plasma
A minimal shared memory object store design
☆54Updated 9 years ago
octoml / octoml-profile
Home for OctoML PyTorch Profiler
☆114Updated 2 years ago
coreweave / ml-containers
☆42Updated this week
NVIDIA / ais-k8s
Kubernetes Operator, ansible playbooks, and production scripts for large-scale AIStore deployments on Kubernetes.
☆113Updated last week
ray-project / mlflow-ray-serve
MLFlow Deployment Plugin for Ray Serve
☆46Updated 3 years ago
GoogleCloudPlatform / slurm-gcp
☆57Updated last week
bentoml / simple_di
Simple dependency injection framework for Python
☆21Updated last year
ray-project / ray_beam_runner
Ray-based Apache Beam runner
☆42Updated 2 years ago
ryantd / veloce
WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.
☆17Updated 3 years ago
triton-inference-server / pytorch_backend
The Triton backend for the PyTorch TorchScript models.
☆166Updated last week
google / space
Unified storage framework for the entire machine learning lifecycle
☆155Updated last year
google / saxml
☆148Updated 3 weeks ago
meta-pytorch / torchfix
TorchFix - a linter for PyTorch-using code with autofix support
☆151Updated 3 months ago
meta-pytorch / torchft
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
☆454Updated 3 weeks ago
pytorch / test-infra
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …
☆103Updated this week
ray-project / ray_shuffling_data_loader
A Ray-based data loader with per-epoch shuffling and configurable pipelining, for shuffling and loading training data for distributed tra…
☆18Updated 2 years ago
nod-ai / transformer-benchmarks
benchmarking some transformer deployments
☆26Updated last week
CentML / DeepView.Profile
🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
☆64Updated 10 months ago
mlcommons / logging
MLPerf™ logging library
☆37Updated last month
meta-pytorch / torchsnapshot
A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…
☆161Updated 2 months ago
kserve / open-inference-protocol
Repository for open inference protocol specification
☆60Updated 6 months ago
rapidsai / ucx-py
Python bindings for UCX
☆140Updated 2 months ago
triton-inference-server / fil_backend
FIL backend for the Triton Inference Server
☆83Updated this week
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆213Updated 7 months ago
fw-ai / benchmark
Benchmark suite for LLMs from Fireworks.ai
☆84Updated last week
meta-pytorch / torchstore
A storage solution for PyTorch tensors with distributed tensor support.
☆45Updated 2 weeks ago