CyanStarNight / CloudNativeSim
A toolkit for modeling and simulation of cloud-native applications.
☆11Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for CloudNativeSim
- Real-Time Intrusion Detection and Prevention with Neural Network in Kernel using eBPF☆12Updated 7 months ago
- ☆63Updated last month
- Cloud Native Benchmarking of Foundation Models☆20Updated last week
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆99Updated 8 months ago
- ☆36Updated 4 months ago
- Latest PASTE (NSDI'18) repository☆13Updated 2 years ago
- HydraGen: A Microservice Benchmark Generator☆17Updated last year
- Selected Topics in Computer Networks @ Johns Hopkins University☆19Updated 3 years ago
- LLM Serving Performance Evaluation Harness☆55Updated 2 months ago
- TraceWeaver is a research prototype for transparently tracing requests through a microservice without application instrumentation.☆18Updated 2 months ago
- Awesome-LLM-KV-Cache: A curated list of 📙Awesome LLM KV Cache Papers with Codes.☆96Updated this week
- ☆43Updated last month
- ☆19Updated 10 months ago
- 📰 Must-read papers on KV Cache Compression (constantly updating 🤗).☆123Updated this week
- Predict the performance of LLM inference services☆11Updated 4 months ago
- ☆51Updated last month
- A tool to detect infrastructure issues on cloud native AI systems☆16Updated 2 weeks ago
- ☆9Updated 3 years ago
- Main repository of the BeFaaS project☆14Updated last year
- Stateful LLM Serving☆37Updated 3 months ago
- The source code of INFless,a native serverless platform for AI inference.☆34Updated 2 years ago
- MagicPIG: LSH Sampling for Efficient LLM Generation☆45Updated 2 weeks ago
- THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression☆14Updated 3 months ago
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆126Updated 3 weeks ago
- ☆42Updated 5 months ago
- ☆14Updated 3 months ago
- Codebase for Autothrottle (NSDI 2024)☆31Updated 10 months ago
- Intent Driven Orchestration enables management of applications through their Service Level Objectives, while minimizing developer and adm…☆34Updated 2 months ago
- A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of …☆98Updated 4 months ago