Awesome-papers is a collection of awesome papers about cloud computing including resource management, serverless, microservice, observerbility and so on.
☆127Dec 23, 2024Updated last year
Alternatives and similar repositories for awesome-papers
Users that are interested in awesome-papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and datasets for FSE'22 paper "Actionable and Interpretable Fault Localization for Recurring Failures in Online Service Systems"☆81Oct 24, 2022Updated 3 years ago
- MicroRank: End-to-End Latency Issue Localization with Extended Spectrum Analysis in Microservice Environments☆39Jan 3, 2022Updated 4 years ago
- A toolkit for hybrid log parsing☆18Aug 23, 2023Updated 2 years ago
- A Spatio-Temporal Deep Learning Approach for Unsupervised Anomaly Detection in Cloud Systems (TNNLS)☆33Feb 14, 2022Updated 4 years ago
- Cloud incidents/failures related work.☆19Jan 7, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The implementation of multimodal observability data root cause analysis approach Nezha in FSE 2023☆70May 20, 2025Updated 10 months ago
- This repository manifests set which is made to build a prototype system of TraceZip, made by 4 pieces.☆14Jul 17, 2025Updated 8 months ago
- GAIA, with the full name Generic AIOps Atlas, is an overall dataset for analyzing operation problems such as anomaly detection, log analy…☆272Jun 16, 2023Updated 2 years ago
- ☆15Jan 7, 2023Updated 3 years ago
- TraceRCA☆16May 13, 2022Updated 3 years ago
- ☆20Nov 10, 2024Updated last year
- DyCause is a root cause analysis method for the microservice system failures.☆44Dec 10, 2021Updated 4 years ago
- An In-kernel Transparent Monitoring System for Microservice Systems with eBPF☆22Sep 11, 2022Updated 3 years ago
- Train Ticket Auto Query Python Scripts☆31Aug 8, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Hipster-Shop with OpenTelemetry☆23Oct 28, 2022Updated 3 years ago
- A Large-scale Evaluation for Log Parsing Techniques: How Far are We? [ISSTA'24]☆138Oct 8, 2025Updated 6 months ago
- TraceWeaver is a research prototype for transparently tracing requests through a microservice without application instrumentation.☆23Sep 2, 2024Updated last year
- Train Ticket - A Benchmark Microservice System☆19Apr 26, 2023Updated 2 years ago
- Failure dataset accompanying the paper "How Bad Can a Bug Get? An Empirical Analysis of Software Failures in the OpenStack Cloud Computi…☆10Jun 12, 2020Updated 5 years ago
- Train Ticket - A Benchmark Microservice System☆876Nov 21, 2025Updated 4 months ago
- Practical Root Cause Localization for Microservice Systems via Trace Analysis. IWQoS 2021☆90Apr 3, 2023Updated 3 years ago
- DeepTraLog: Trace-Log Combined Microservice Anomaly Detection through Graph-based Deep Learning☆13Mar 24, 2023Updated 3 years ago
- A curated list of awesome academic researches and industrial materials about Artificial Intelligence for IT Operations (AIOps).☆309Feb 12, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆66Feb 9, 2023Updated 3 years ago
- ☆48Jan 11, 2023Updated 3 years ago
- ☆21Nov 14, 2024Updated last year
- Predict the performance of LLM inference services☆23Sep 18, 2025Updated 6 months ago
- ☆178Mar 12, 2024Updated 2 years ago
- An LLM-based system that fully automates Chaos Engineering (ASE 2025, NIER track)☆26Apr 6, 2026Updated last week
- [FSE'26][WWW'25][ASE'24] RCAEval: A Benchmark for Root Cause Analysis.☆117Apr 2, 2026Updated last week
- AIOps (Papers, Tutorials, and Datasets)☆14Feb 8, 2021Updated 5 years ago
- A list of awesome academic researches and industrial materials about Large Language Model (LLM) and Artificial Intelligence for IT Operat…☆427Feb 21, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- MTAD: Tools and Benchmark for Multivariate Time Series Anomaly Detection☆133Dec 18, 2024Updated last year
- ☆61Feb 7, 2023Updated 3 years ago
- The source code for "Unsupervised Anomaly Detection on Microservice Traces through Graph VAE" in WWW2023.☆26May 2, 2023Updated 2 years ago
- ☆40Oct 25, 2023Updated 2 years ago
- [FSE'24 - 🏆 Best Artifact Award] BARO: Robust Root Cause Analysis for Time Series Data.☆56Mar 10, 2026Updated last month
- Source code of MicroRCA☆75May 19, 2023Updated 2 years ago
- Log Parsing with Prompt-based Few-shot Learning (ICSE 2023, Technical Track)☆72Sep 10, 2025Updated 7 months ago