A universal scalable machine learning model deployment solution
☆253May 1, 2026Updated this week
Alternatives and similar repositories for djl-serving
Users that are interested in djl-serving are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Demo applications showcasing DJL☆351Apr 11, 2026Updated 3 weeks ago
- The Java implementation of Dive into Deep Learning (D2L.ai)☆191Updated this week
- An Engine-Agnostic Deep Learning Framework in Java☆4,806Updated this week
- ☆46Aug 4, 2025Updated 8 months ago
- ☆112Jan 16, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Large Language Model Hosting Container☆92Apr 13, 2026Updated 2 weeks ago
- Example code for AWS Neuron SDK developers building inference and training applications☆158Apr 2, 2026Updated last month
- ☆11Jan 1, 2024Updated 2 years ago
- One stop shop for running AI/ML on AWS.☆1,152Updated this week
- Utility which provides a UI to do prompt engineering within SageMaker Studio.☆14Jul 5, 2023Updated 2 years ago
- Training and inference on AWS Trainium and Inferentia chips.☆267Apr 16, 2026Updated 2 weeks ago
- ☆271Apr 7, 2026Updated 3 weeks ago
- ☆13Nov 1, 2024Updated last year
- Deployment code for image generative AI and other related image based tasks.☆22May 15, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Hands-on workshop for distributed training and hosting on SageMaker☆153Nov 4, 2025Updated 5 months ago
- ☆22Updated this week
- ☆24Apr 24, 2026Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆25Mar 5, 2026Updated last month
- FM-Leaderboard-er allows you to create leaderboard to find the best LLM/prompt for your own business use case based on your data, task, p…☆19Oct 31, 2024Updated last year
- ☆34Apr 21, 2026Updated last week
- Foundation Model Evaluations Library☆284Aug 7, 2025Updated 8 months ago
- Serve machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.☆412Nov 20, 2023Updated 2 years ago
- ☆41Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 提供产品级IOCR自定义模板识别,以图搜图,人像搜索等,免费,可商用,Java AI 人工智能一站式解决方案,为工作减负,为产品研发加速。项 目类别包括:以及Java版 Pytorch 训练引擎,AI SDK,web应用等。☆977Apr 25, 2026Updated last week
- Python 3.6+ module to make Flask compatible with AWS Lambda☆10Jul 5, 2023Updated 2 years ago
- Apache Spark based framework for analysis A/B experiments☆15Nov 3, 2024Updated last year
- AWS serverless Notifier - Serverless Application for easily integration of Dingtalk / Feishu / Slack / Telegram for Event Notification☆35Dec 9, 2025Updated 4 months ago
- A vision-language model with an improved cross-attention mechanism for scalable streaming inference☆29Mar 9, 2026Updated last month
- Large Language Model Text Generation Inference☆10,848Mar 21, 2026Updated last month
- ☆43Jan 29, 2026Updated 3 months ago
- Multi-Agent AI Coding Assistant - 支持 DeepSeek/文心/通义 等12+国产大模型,多智能体协作编程☆89Updated this week
- ☆13May 17, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- Deploy open-source LLMs on AWS in minutes — with OpenAI-compatible APIs and a powerful CLI/SDK toolkit.☆74Aug 26, 2025Updated 8 months ago
- ☆14Dec 20, 2023Updated 2 years ago
- Multi Model Server is a tool for serving neural net models for inference☆1,026May 20, 2024Updated last year
- ☆15Mar 15, 2021Updated 5 years ago
- This Guidance demonstrates how to streamline access to numerous large language models (LLMs) through a unified, industry-standard API gat…☆217Apr 24, 2026Updated last week
- Hypermodern Python Cookiecutter☆22Apr 24, 2026Updated last week