deepjavalibrary / djl-servingLinks
A universal scalable machine learning model deployment solution
☆246Updated this week
Alternatives and similar repositories for djl-serving
Users that are interested in djl-serving are comparing it to the libraries listed below
Sorting:
- ☆111Updated last year
- Training and inference on AWS Trainium and Inferentia chips.☆259Updated this week
- Example code for AWS Neuron SDK developers building inference and training applications☆157Updated 3 weeks ago
- ☆271Updated 9 months ago
- Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.☆667Updated this week
- Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…☆578Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆25Updated 3 months ago
- Examples on how to use LangChain and Ray☆232Updated 2 years ago
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆142Updated last year
- ☆328Updated this week
- Hands-on workshop for distributed training and hosting on SageMaker☆151Updated 3 months ago
- ☆413Updated 2 years ago
- Serve machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.☆412Updated 2 years ago
- ☆73Updated last year
- ☆45Updated 6 months ago
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆677Updated this week
- Foundation Model Evaluations Library☆276Updated 6 months ago
- LLMPerf is a library for validating and benchmarking LLMs☆1,084Updated last year
- ☆64Updated last month
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆504Updated last week
- Common source, scripts and utilities for creating Triton backends.☆366Updated last week
- A helper library to connect into Amazon SageMaker with AWS Systems Manager and SSH (Secure Shell)☆258Updated 7 months ago
- ml-commons provides a set of common machine learning algorithms, e.g. k-means, or linear regression, to help developers build ML related …☆143Updated this week
- Large Language Model Hosting Container☆91Updated 4 months ago
- The Triton TensorRT-LLM Backend☆918Updated this week
- Neural search transforms text into vectors and facilitates vector search both at ingestion time and at search time.☆110Updated this week
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆254Updated 9 months ago
- ☆25Updated this week
- LLMPerf is a library for validating and benchmarking LLMs☆11Updated last year
- The Triton backend for the ONNX Runtime.☆173Updated this week