Albus-Tan / SJTU-SE3357-OS-notesLinks
SJTU SE3357 操作系统笔记 OS Notes
☆16Updated 2 years ago
Alternatives and similar repositories for SJTU-SE3357-OS-notes
Users that are interested in SJTU-SE3357-OS-notes are comparing it to the libraries listed below
Sorting:
- Paper Reading:涉及分布式、虚拟化、网络、机器学习☆23Updated 4 years ago
- A system which deploys and manages containerized applications. Course project of SJTU SE3356, 2022.☆16Updated 3 years ago
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆35Updated last year
- 应用系统体系架构☆24Updated last year
- The schedule of the seminar☆25Updated 3 years ago
- 📄 📃 papers that I read and noted 🧐☆33Updated last month
- ☆147Updated this week
- Group project of SE3356 Cloud Operating System Design and Practice, Spring 2022.☆26Updated 3 years ago
- Fast OS-level support for GPU checkpoint and restore☆232Updated 2 weeks ago
- distributed consensus protocol's bugs, flaws, deceptive traps, improvements☆120Updated 3 months ago
- system paper reading notes☆246Updated 3 years ago
- A mini version of k8s that implements the abstraction of pod, service, auto-scaling, replicaSet and provides DNS, GPU and serverless serv…☆16Updated 2 years ago
- SCV is a distributed cluster GPU sniffer. SCV是一个分布式GPU嗅探器☆21Updated 2 years ago
- A curated list of awesome serverless research works, including papers and open-sourced projects.☆87Updated 2 years ago
- A mini container orchestration tool similar to k8s.☆35Updated 3 years ago
- A distributed in-memory store for temporal knowledge graphs☆10Updated last year
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆29Updated 5 months ago
- NVIDIA device plugin for Kubernetes☆15Updated 5 years ago
- A website providing info for self-learners who want to explore the world of operating systems. The website template is from https://githu…☆51Updated 4 years ago
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆32Updated 2 weeks ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆55Updated 3 years ago
- Minik8s For SJTU-CloudOS Course.☆55Updated 2 years ago
- KV cache store for distributed LLM inference☆314Updated 2 months ago
- Automatic tuning for ML model deployment on Kubernetes☆80Updated 9 months ago
- ☆16Updated 2 years ago
- ☆33Updated this week
- A scheduling framework for multitasking over diverse XPUs, including GPUs, NPUs, ASICs, and FPGAs☆94Updated last week
- This repository contains statistics about the AI Infrastructure products.☆17Updated 6 months ago
- Curve meetup slides☆42Updated last year
- OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)☆245Updated last week