shady1543 / eACGMLinks
[IWQoS 2025] eACGM: An eBPF-based Automated Comprehensive Governance and Monitoring framework for AI/ML systems.
☆19Updated 3 months ago
Alternatives and similar repositories for eACGM
Users that are interested in eACGM are comparing it to the libraries listed below
Sorting:
- Lightweight daemon for monitoring CUDA runtime API calls with eBPF uprobes☆140Updated 7 months ago
- An eBPF kernel Observable Agent To Spy Performance Issue On OS.☆13Updated 3 weeks ago
- FaaSNet: Scalable and Fast Provisioning of Custom Serverless Container Runtimes at Alibaba Cloud Function Compute (USENIX ATC'21)☆55Updated 3 years ago
- Probe TCP metrics and latencies from the kernel with BCC☆31Updated 6 years ago
- Real-Time Intrusion Detection and Prevention with Neural Network in Kernel using eBPF☆21Updated last year
- Live upgrade Linux kernel scheduler subsystem☆88Updated 2 years ago
- [NSDI '24] DINT: Fast In-Kernel Distributed Transactions with eBPF☆49Updated last year
- Serverless Paper Reading and Discussion☆38Updated 2 years ago
- eBPF sockops samples for performance optimization☆69Updated last year
- TraceWeaver is a research prototype for transparently tracing requests through a microservice without application instrumentation.☆22Updated last year
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆33Updated last week
- This repository contains experimental tools we developed to forecast a clusters' resource (CPU or memory) usage.☆43Updated 4 years ago
- Repository linking to the software artifacts used for the MigrOS ATC 2021 paper☆18Updated 4 years ago
- An In-kernel Transparent Monitoring System for Microservice Systems with eBPF☆23Updated 3 years ago
- XRP: In-Kernel Storage Functions with eBPF☆234Updated 2 years ago
- ☆37Updated last month
- ☆24Updated 5 months ago
- https://github.com/eunomia-bpf homepage, documents and blogs☆136Updated last week
- ☆106Updated last year
- A storage plugin that provided CRI-O/Podman with the ability to lazy mount nydus images.☆40Updated 6 months ago
- Interference-aware CPU scheduling that enables performance isolation and high CPU utilization for datacenter servers☆179Updated last month
- ☆35Updated 8 months ago
- ☆23Updated 3 weeks ago
- A high performance ACL based on XDP. GPL-2.0 License.☆15Updated 2 years ago
- Kernel profiler based on perf_event and ebpf☆106Updated this week
- A tool to detect infrastructure issues on cloud native AI systems☆51Updated 2 months ago
- A curated list of awesome serverless research works, including papers and open-sourced projects.☆86Updated 2 years ago
- Compiler plugin for performance analysis of HIP applications☆12Updated 7 months ago
- Kernel Extensions Large Language Model Agent☆31Updated last year
- Zero instrucment LLM and AI agent (e.g. claude code, gemini-cli) observability in eBPF☆144Updated 2 weeks ago