This is a landscape of the infrastructure that powers the generative AI ecosystem
☆155Oct 16, 2024Updated last year
Alternatives and similar repositories for ai-infra-landscape
Users that are interested in ai-infra-landscape are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains statistics about the AI Infrastructure products.☆17Feb 27, 2025Updated last year
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆76Jul 18, 2025Updated 8 months ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 6 years ago
- envd Website☆12Feb 7, 2026Updated 2 months ago
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆282Nov 3, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A high-performance timeline tracing library for Golang, used by TiDB☆47Mar 10, 2026Updated last month
- GitHub Action for Continuous Profiling which you can run to profile your CI/CD. It uses parca and Polar Signals cloud.☆15Feb 10, 2026Updated 2 months ago
- ☆21Mar 20, 2026Updated 3 weeks ago
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated 11 months ago
- NVIDIA device plugin for Kubernetes☆15Sep 9, 2019Updated 6 years ago
- 🏆 CNCF Community Awards☆26Mar 9, 2026Updated last month
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆31Mar 28, 2025Updated last year
- ☆19Apr 11, 2024Updated 2 years ago
- ☆13Jun 28, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Large language model fine-tuning capabilities based on cloud native and distributed computing.☆92Feb 22, 2024Updated 2 years ago
- 🏕️ Reproducible development environment for humans and agents☆2,192Updated this week
- OpenAI compatible API for open source LLMs☆17Oct 30, 2023Updated 2 years ago
- Canary release with helm (Deprecated since compass v2.8)☆13Sep 28, 2020Updated 5 years ago
- engula-operator creates/configures/manages engula clusters atop Kubernetes☆12Jan 5, 2022Updated 4 years ago
- Deploy ChatGLM on Modelz☆16Mar 20, 2023Updated 3 years ago
- ☆17Jul 18, 2025Updated 8 months ago
- ☆27Mar 31, 2026Updated last week
- LLM Serving Performance Evaluation Harness☆84Feb 25, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Cloud Native ML/DL Platform☆132Sep 9, 2020Updated 5 years ago
- A fast & easy way to train ML models in your cloud, directly from your laptop.☆14Mar 28, 2022Updated 4 years ago
- ☆31Jun 15, 2021Updated 4 years ago
- ☆10Jul 29, 2020Updated 5 years ago
- Container Object Storage Interface (COSI) provisioner responsible to interface with COSI drivers. NOTE: The content of this repo has bee…☆33Nov 26, 2024Updated last year
- ☆20Updated this week
- Application releases based on helm☆18Feb 5, 2021Updated 5 years ago
- 围绕云原生知识体系,收集一些不错的文章。仅供学习参考。☆53Jul 11, 2022Updated 3 years ago
- clustering algorithm implementation☆13Nov 3, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine☆897Apr 1, 2026Updated last week
- An awesome list of low- and no-code generative AI resources.☆53Jun 4, 2024Updated last year
- Vocabulary Parallelism☆25Mar 10, 2025Updated last year
- Test Orchestrator for Performance and Scalability of AI pLatforms☆16Updated this week
- A collection of AI solutions for oracle.ai☆18Apr 2, 2026Updated last week
- Container images for cloudnative-pg with the pgvecto.rs extension installed☆12Jan 17, 2024Updated 2 years ago
- ☆38Oct 16, 2025Updated 5 months ago