shady1543 / eACGMLinks
[IWQoS 2025] eACGM: An eBPF-based Automated Comprehensive Governance and Monitoring framework for AI/ML systems.
☆17Updated 3 weeks ago
Alternatives and similar repositories for eACGM
Users that are interested in eACGM are comparing it to the libraries listed below
Sorting:
- An eBPF kernel Observable Agent To Spy Performance Issue On OS.☆13Updated 7 months ago
- Probe TCP metrics and latencies from the kernel with BCC☆30Updated 6 years ago
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆32Updated 3 weeks ago
- Live upgrade Linux kernel scheduler subsystem☆88Updated 2 years ago
- ☆52Updated last month
- [NSDI '24] DINT: Fast In-Kernel Distributed Transactions with eBPF☆45Updated last year
- Lightweight daemon for monitoring CUDA runtime API calls with eBPF uprobes☆126Updated 5 months ago
- A collection of CUDA programming examples to learn GPU programming☆25Updated 3 months ago
- Real-Time Intrusion Detection and Prevention with Neural Network in Kernel using eBPF☆19Updated last year
- A storage plugin that provided CRI-O/Podman with the ability to lazy mount nydus images.☆38Updated 3 months ago
- An In-kernel Transparent Monitoring System for Microservice Systems with eBPF☆21Updated 2 years ago
- FaaSNet: Scalable and Fast Provisioning of Custom Serverless Container Runtimes at Alibaba Cloud Function Compute (USENIX ATC'21)☆55Updated 3 years ago
- XDP Deployments in Userspace eBPF☆15Updated last month
- This repository contains experimental tools we developed to forecast a clusters' resource (CPU or memory) usage.☆42Updated 4 years ago
- Serverless Paper Reading and Discussion☆37Updated 2 years ago
- ☆33Updated last week
- eBPF sockops samples for performance optimization☆70Updated last year
- An OS kernel module for fast **remote** fork using advanced datacenter networking (RDMA).☆64Updated 6 months ago
- Kernel Extensions Large Language Model Agent☆30Updated last year
- A tool to detect infrastructure issues on cloud native AI systems☆47Updated last month
- Interference-aware CPU scheduling that enables performance isolation and high CPU utilization for datacenter servers☆160Updated 3 weeks ago
- Kernel profiler based on perf_event and ebpf☆100Updated last month
- Tools for use with AF_SMC sockets☆22Updated last month
- ☆20Updated 10 months ago
- Compiler plugin for performance analysis of HIP applications☆12Updated 4 months ago
- eBPF Standard Documentation☆48Updated 11 months ago
- ☆23Updated 3 months ago
- TraceWeaver is a research prototype for transparently tracing requests through a microservice without application instrumentation.☆20Updated last year
- XRP: In-Kernel Storage Functions with eBPF☆232Updated 2 years ago
- A high performance ACL based on XDP. GPL-2.0 License.☆15Updated 2 years ago