SCV is a distributed cluster GPU sniffer. SCV是一个分布式GPU嗅探器
☆20Feb 25, 2023Updated 3 years ago
Alternatives and similar repositories for SCV
Users that are interested in SCV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NodeSimulator can simulate the node resources and state in kubernetes and simulate the state of pod.☆11Nov 7, 2021Updated 4 years ago
- Yoda is a kubernetes scheduler based on GPU metrics. Yoda是一个基于GPU参数指标的 Kubernetes 调度器☆137Mar 27, 2022Updated 4 years ago
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆33Nov 11, 2023Updated 2 years ago
- ☆131Apr 19, 2021Updated 5 years ago
- ☆12Nov 21, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Discover and pre-pull Docker images on Kubernetes nodes to speed up containers bootstrap and autoscaling☆20Mar 28, 2023Updated 3 years ago
- Transparent checkpoint/restart library for CUDA application.☆12Mar 9, 2015Updated 11 years ago
- ☆24Jun 12, 2023Updated 2 years ago
- A LaTeX beamer theme template for UCAS students.☆12Apr 21, 2024Updated 2 years ago
- Helios Traces from SenseTime☆63Sep 27, 2022Updated 3 years ago
- elastic-gpu-agent is a Kubernetes device plugin for GPU resources allocation on node.☆54Jul 27, 2022Updated 3 years ago
- 南京大学2024研究生秋季学期分布式系统期末复习☆18Jan 3, 2025Updated last year
- RESPECT: Reinforcement Learning based Edge Scheduling on Pipelined Coral Edge TPUs (DAC'23)☆11Apr 13, 2023Updated 3 years ago
- 通过系统编程学习Rust☆10Mar 8, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GPU topology-aware scheduler☆13Jul 7, 2017Updated 8 years ago
- Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆46Nov 24, 2022Updated 3 years ago
- Splits single Nvidia GPU into multiple partitions with complete compute and memory isolation (wrt to performace) between the partitions☆164Apr 21, 2019Updated 7 years ago
- ☆21Jul 11, 2024Updated last year
- ☆19Jan 27, 2025Updated last year
- ☆53Dec 26, 2024Updated last year
- SJTU HPC 开源项目:Spackenv (Spack ENVironment) switch environments between sysadmin, users and developers.☆22Jan 4, 2022Updated 4 years ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆13Mar 7, 2024Updated 2 years ago
- This repo is a sample for Kubernetes scheduler framework.☆47Oct 9, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- implement some custom schedulers based on kubernetes scheduler framework(基于k8s调度框架实现的调度器 插件,用于扩展调度逻辑)☆20Nov 30, 2022Updated 3 years ago
- Verification and optimization tool for concurrent code☆28Jul 29, 2025Updated 10 months ago
- FalkorDB port to Rust☆13Mar 26, 2026Updated 2 months ago
- Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs☆60May 21, 2023Updated 3 years ago
- Reading paper list for iCloud group☆14May 3, 2026Updated last month
- Mainly some ppt, pdf files for easily management☆14Aug 20, 2024Updated last year
- MoCo: A One-Stop Shop for Model Collaboration Research☆56May 27, 2026Updated last week
- Eagle is a lightweight and intelligent p2p based docker image distribution system.☆38Jan 27, 2021Updated 5 years ago
- Intercepting CUDA runtime calls with LD_PRELOAD☆43Mar 11, 2014Updated 12 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SmartFD: Efficient and Scalable Functional Dependency Discovery on Distributed Data-Parallel Platforms☆18Aug 23, 2018Updated 7 years ago
- Personal repo for random Go stuff☆15Jul 11, 2019Updated 6 years ago
- Paper Reading:涉及分布式、虚拟化、网络、机器学习☆22Sep 27, 2020Updated 5 years ago
- Open Service Broker Implementation Based on the Crunchy PostgreSQL Operator☆13Feb 15, 2023Updated 3 years ago
- ☆331Jan 22, 2024Updated 2 years ago
- Network Contention-Aware Cluster Scheduling with Reinforcement Learning (IEEE ICPADS'23)☆20Jul 8, 2025Updated 11 months ago
- Rafiki is a distributed system that supports training and deployment of machine learning models using AutoML, built with ease-of-use in …☆34Dec 11, 2022Updated 3 years ago