shady1543 / eACGMLinks
[IWQoS 2025] eACGM: An eBPF-based Automated Comprehensive Governance and Monitoring framework for AI/ML systems.
☆19Updated last month
Alternatives and similar repositories for eACGM
Users that are interested in eACGM are comparing it to the libraries listed below
Sorting:
- An eBPF kernel Observable Agent To Spy Performance Issue On OS.☆13Updated 8 months ago
- Lightweight daemon for monitoring CUDA runtime API calls with eBPF uprobes☆130Updated 6 months ago
- A collection of CUDA programming examples to learn GPU programming☆30Updated 4 months ago
- Live upgrade Linux kernel scheduler subsystem☆88Updated 2 years ago
- Probe TCP metrics and latencies from the kernel with BCC☆30Updated 6 years ago
- [NSDI '24] DINT: Fast In-Kernel Distributed Transactions with eBPF☆48Updated last year
- eBPF sockops samples for performance optimization☆71Updated last year
- Official repository of Alibaba erdma drivers☆33Updated 2 months ago
- Real-Time Intrusion Detection and Prevention with Neural Network in Kernel using eBPF☆18Updated last year
- XDP Deployments in Userspace eBPF☆21Updated 2 months ago
- ☆52Updated 2 months ago
- Kernel profiler based on perf_event and ebpf☆104Updated 3 weeks ago
- XRP: In-Kernel Storage Functions with eBPF☆232Updated 2 years ago
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆33Updated last week
- An In-kernel Transparent Monitoring System for Microservice Systems with eBPF☆22Updated 3 years ago
- ☆35Updated this week
- ☆20Updated last week
- Repository linking to the software artifacts used for the MigrOS ATC 2021 paper☆18Updated 4 years ago
- Kernel Extensions Large Language Model Agent☆30Updated last year
- eBPF Standard Documentation☆50Updated last year
- An OS kernel module for fast **remote** fork using advanced datacenter networking (RDMA).☆64Updated 7 months ago
- Generate eBPF programs and tracing with ChatGPT☆254Updated 2 months ago
- Serverless Paper Reading and Discussion☆37Updated 2 years ago
- Health checks for Azure N- and H-series VMs.☆51Updated last month
- TraceWeaver is a research prototype for transparently tracing requests through a microservice without application instrumentation.☆20Updated last year
- Interference-aware CPU scheduling that enables performance isolation and high CPU utilization for datacenter servers☆168Updated last week
- AI/GPU flame graph☆186Updated last week
- ☆64Updated this week
- Tools for use with AF_SMC sockets☆22Updated 2 months ago
- Examples of using BPF ring buffer APIs☆134Updated 4 years ago