This is a landscape of the infrastructure that powers the generative AI ecosystem
☆157Oct 16, 2024Updated last year
Alternatives and similar repositories for ai-infra-landscape
Users that are interested in ai-infra-landscape are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains statistics about the AI Infrastructure products.☆17Feb 27, 2025Updated last year
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆79Apr 14, 2026Updated 2 months ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Apr 1, 2020Updated 6 years ago
- envd Website☆12Feb 7, 2026Updated 4 months ago
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆283Nov 3, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A high-performance timeline tracing library for Golang, used by TiDB☆47Jun 26, 2026Updated last week
- ☆21May 5, 2026Updated last month
- AI-based search done right☆20Dec 25, 2025Updated 6 months ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆27Apr 24, 2025Updated last year
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- 🏆 CNCF Community Awards☆27Mar 9, 2026Updated 3 months ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆31Mar 28, 2025Updated last year
- Holistic job manager on Kubernetes☆117Feb 20, 2024Updated 2 years ago
- Benchmark results from code generation with LLMs☆17Sep 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Apr 11, 2024Updated 2 years ago
- Feasibility research for using kind to support e2e tests for kubebuilder(v2)-generated Kubernetes operators.☆13Jul 1, 2019Updated 7 years ago
- ☆13Jun 28, 2024Updated 2 years ago
- ☆10Apr 15, 2026Updated 2 months ago
- Large language model fine-tuning capabilities based on cloud native and distributed computing.☆92Feb 22, 2024Updated 2 years ago
- 🏕️ Reproducible development environment for humans and agents☆2,211May 21, 2026Updated last month
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- Canary release with helm (Deprecated since compass v2.8)☆13Sep 28, 2020Updated 5 years ago
- Deploy ChatGLM on Modelz☆16Mar 20, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆20Apr 12, 2026Updated 2 months ago
- LLM Serving Performance Evaluation Harness☆84Feb 25, 2025Updated last year
- Cloud Native ML/DL Platform☆132Sep 9, 2020Updated 5 years ago
- ☆30Jun 15, 2021Updated 5 years ago
- Batch-scheduler based on K8s scheduling framework, related features have contributed to scheduler-plugins(Deprecated).☆26Aug 6, 2020Updated 5 years ago
- ☆10Jul 29, 2020Updated 5 years ago
- Container Object Storage Interface (COSI) provisioner responsible to interface with COSI drivers. NOTE: The content of this repo has bee…☆32Nov 26, 2024Updated last year
- ☆20Jun 27, 2026Updated last week
- 中国开发者活动日程(关注点:开源、开发者、云原生)☆25Jun 26, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Application releases based on helm☆18Feb 5, 2021Updated 5 years ago
- ☆32Updated this week
- 围绕云原生知识体系,收集一些不错的文章。仅供学习参考。☆52Jul 11, 2022Updated 3 years ago
- init to record my learning path of AI Infra, especially on inference.☆221Jun 22, 2026Updated last week
- clustering algorithm implementation☆13May 13, 2026Updated last month
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine☆902Jun 25, 2026Updated last week
- Vocabulary Parallelism☆26Mar 10, 2025Updated last year