A holistic framework to enable the design, development, and evaluation of autonomous AIOps agents.
☆884May 19, 2026Updated last week
Alternatives and similar repositories for AIOpsLab
Users that are interested in AIOpsLab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A list of awesome academic researches and industrial materials about Large Language Model (LLM) and Artificial Intelligence for IT Operat…☆445Feb 21, 2026Updated 3 months ago
- A holistic framework to enable the design, development, and evaluation of autonomous AIOps agents.☆12May 21, 2025Updated last year
- ⚠️ ARCHIVED - All development moved to https://github.com/itbench-hub/ITBench/tree/main/scenarios☆15Feb 24, 2026Updated 3 months ago
- Cloud incidents/failures related work.☆20Jan 7, 2025Updated last year
- [ICLR'25] OpenRCA: Can Large Language Models Locate the Root Cause of Software Failures?☆343Apr 14, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Jan 7, 2023Updated 3 years ago
- Multi-modal & multi-domain customer service agent with real time text, voice and soon video☆77Feb 14, 2026Updated 3 months ago
- An open source benchmarking framework for IT automation☆316Updated this week
- ☁️ Benchmarking LLMs for Cloud Config Generation | 云场景下的大模型基准测试☆41Oct 25, 2024Updated last year
- An LLM-based system that fully automates Chaos Engineering (ASE 2025, NIER track)☆28Apr 6, 2026Updated last month
- Collection of slides, repositories, papers about AIOps☆1,533Mar 17, 2026Updated 2 months ago
- Awesome resources for failure diagnosis research.☆60Apr 26, 2026Updated 3 weeks ago
- ☆55Apr 8, 2026Updated last month
- The LLMAgentOps Toolkit is a repository that provides a foundational structure for building LLM Agent-based applications using the Semant…☆17Apr 1, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- End-to-end Generative Optimization for AI Agents☆737Dec 10, 2025Updated 5 months ago
- Code for "LEMMA-RCA: A Large Multi-modal Multi-domain Dataset for Root Cause Analysis" paper☆29Oct 6, 2025Updated 7 months ago
- Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks including C…☆5,558Mar 19, 2026Updated 2 months ago
- Real-Time Intrusion Detection and Prevention with Neural Network in Kernel using eBPF☆25Apr 9, 2024Updated 2 years ago
- 🤖⚡ Streamlining Prior Authorization with AutoAuth Framework and Azure AI☆20Mar 25, 2026Updated 2 months ago
- Code and datasets for FSE'22 paper "Actionable and Interpretable Fault Localization for Recurring Failures in Online Service Systems"☆82Oct 24, 2022Updated 3 years ago
- Task-Aware Agent-driven Prompt Optimization Framework☆3,867Oct 13, 2025Updated 7 months ago
- AutoLog: A Log Sequence Synthesis Framework for Anomaly Detection [ASE'23]☆40Feb 20, 2024Updated 2 years ago
- GAIA, with the full name Generic AIOps Atlas, is an overall dataset for analyzing operation problems such as anomaly detection, log analy…☆275Jun 16, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A repo to accelerate development and testing of GenAI Gateways built with Azure API Management. Includes various capabilities as examples…☆64Jan 29, 2025Updated last year
- Code repository for SRE agent as part of ITBench☆19Sep 9, 2025Updated 8 months ago
- [EMNLP 2024 (Findings)] mABC: multi-Agent Blockchain-inspired Collaboration for root cause analysis in micro-services architecture☆73Dec 31, 2024Updated last year
- Zodiac: Unearthing Semantic Checks for Cloud Infrastructure-as-Code Programs, SOSP 2024☆15Nov 28, 2024Updated last year
- ☆48Jan 11, 2023Updated 3 years ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆33,186Updated this week
- OctoTools: An agentic framework with extensible tools for complex reasoning☆1,462May 2, 2026Updated 3 weeks ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆694Mar 16, 2025Updated last year
- ☆25May 30, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NeurIPS'25] Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)☆32Aug 8, 2025Updated 9 months ago
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆768Dec 16, 2025Updated 5 months ago
- 2019AIOps: The 2nd match for AIOps☆27Aug 8, 2022Updated 3 years ago
- Creates an Azure AI Service and deploys the specified models.☆18Aug 22, 2025Updated 9 months ago
- The implementation of multimodal observability data root cause analysis approach Nezha in FSE 2023☆70May 20, 2025Updated last year
- A sample demo for building and testing react components and includes a set of unique features including AI component generation and autom…☆15Jun 27, 2024Updated last year
- One-click deploy of a Knowledge Graph powered RAG (GraphRAG) in Azure☆2,414May 27, 2025Updated 11 months ago