A universal scalable machine learning model deployment solution
☆248Mar 12, 2026Updated last week
Alternatives and similar repositories for djl-serving
Users that are interested in djl-serving are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Demo applications showcasing DJL☆352Mar 11, 2026Updated last week
- An Engine-Agnostic Deep Learning Framework in Java☆4,794Updated this week
- This open-source project delivers a complete pipeline for converting multi-page documents (PDFs/images) into structured JSON using Vision…☆15Aug 4, 2025Updated 7 months ago
- ☆45Aug 4, 2025Updated 7 months ago
- The Java implementation of Dive into Deep Learning (D2L.ai)☆191Mar 16, 2026Updated last week
- ☆110Jan 16, 2025Updated last year
- Large Language Model Hosting Container☆91Mar 11, 2026Updated last week
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆142Oct 7, 2024Updated last year
- Example code for AWS Neuron SDK developers building inference and training applications☆158Mar 10, 2026Updated last week
- ☆11Jan 1, 2024Updated 2 years ago
- ☆133Updated this week
- One stop shop for running AI/ML on AWS.☆1,145Updated this week
- Training and inference on AWS Trainium and Inferentia chips.☆264Updated this week
- ☆270Updated this week
- ☆14Nov 1, 2024Updated last year
- Tools to measure latency for LLM in Amazon Bedrook☆22Jan 20, 2026Updated 2 months ago
- Deployment code for image generative AI and other related image based tasks.☆22May 15, 2023Updated 2 years ago
- ☆25Mar 6, 2026Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆25Mar 5, 2026Updated 2 weeks ago
- FM-Leaderboard-er allows you to create leaderboard to find the best LLM/prompt for your own business use case based on your data, task, p…☆19Oct 31, 2024Updated last year
- Foundation Model Evaluations Library☆278Aug 7, 2025Updated 7 months ago
- ☆32Updated this week
- ☆33Jun 11, 2025Updated 9 months ago
- Serve machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.☆412Nov 20, 2023Updated 2 years ago
- A Java port of whisper 3, based on the huggingface version, using DJL.☆20Apr 3, 2024Updated last year
- ☆24Oct 18, 2023Updated 2 years ago
- TensorFlow Serving API of all programming language supported by protobuf and grpc.☆27Oct 16, 2020Updated 5 years ago
- ☆58Feb 5, 2026Updated last month
- ☆13Dec 19, 2025Updated 3 months ago
- AWS serverless Notifier - Serverless Application for easily integration of Dingtalk / Feishu / Slack / Telegram for Event Notification☆35Dec 9, 2025Updated 3 months ago
- A vision-language model with an improved cross-attention mechanism for scalable streaming inference☆28Mar 9, 2026Updated 2 weeks ago
- Serve, optimize and scale PyTorch models in production☆4,360Aug 6, 2025Updated 7 months ago
- Large Language Model Text Generation Inference☆10,812Jan 8, 2026Updated 2 months ago
- ☆13May 17, 2021Updated 4 years ago
- Unofficial implementation of DreamTalk in ComfyUI☆12Aug 15, 2024Updated last year
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- ☆14Dec 20, 2023Updated 2 years ago
- ☆81May 10, 2024Updated last year
- Multi Model Server is a tool for serving neural net models for inference☆1,025May 20, 2024Updated last year