A tool to detect infrastructure issues on cloud native AI systems
☆53Sep 18, 2025Updated 6 months ago
Alternatives and similar repositories for autopilot
Users that are interested in autopilot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AppWrapper controller for Kueue☆17Updated this week
- llm-d benchmark scripts and tooling☆54Apr 3, 2026Updated last week
- Cloud Native Benchmarking of Foundation Models☆45Jul 31, 2025Updated 8 months ago
- Failure dataset accompanying the paper "How Bad Can a Bug Get? An Empirical Analysis of Software Failures in the OpenStack Cloud Computi…☆10Jun 12, 2020Updated 5 years ago
- ☆15Jan 7, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Holistic job manager on Kubernetes☆116Feb 20, 2024Updated 2 years ago
- Predict the performance of LLM inference services☆23Sep 18, 2025Updated 6 months ago
- A hierarchical collective communications library with portable optimizations☆37Dec 8, 2024Updated last year
- Solution Service Architecture☆25Jun 5, 2024Updated last year
- Real-Time Intrusion Detection and Prevention with Neural Network in Kernel using eBPF☆24Apr 9, 2024Updated 2 years ago
- Red Hat Certified optional operator for secondary schedulers☆21Updated this week
- Project to manage Flux tasks needed to standardize kubernetes HPC scheduling interfaces☆28Jan 9, 2026Updated 3 months ago
- ☆15Updated this week
- Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)☆51Mar 17, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Augmented Dickey-Fuller implementation in Go☆12Mar 15, 2019Updated 7 years ago
- Fast and efficient attention method exploration and implementation.☆25Mar 25, 2025Updated last year
- [DEPRECATED] Prometheus exporter for VPA recommendations☆12Aug 22, 2023Updated 2 years ago
- Snapped is a parallel program snapshotter designed for debugging deadlocks and crashes in programs. It acts as a wrapper around the GDB M…☆11Aug 26, 2024Updated last year
- Scripts for managing a large H100 cluster and fixing hardware issues to ensure smooth model training.☆323Aug 20, 2024Updated last year
- A suite of parallel file system tools designed for performance and scalability☆29May 14, 2024Updated last year
- ☆10Dec 10, 2024Updated last year
- Code and other materials for the S2I2 Software Summer School☆12Mar 11, 2017Updated 9 years ago
- A clean monorepo template for a Python project using uv☆13Jul 8, 2025Updated 9 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The link to the website is at