☆85Feb 5, 2026Updated 3 months ago
Alternatives and similar repositories for nexus
Users that are interested in nexus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications☆127May 9, 2022Updated 3 years ago
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated last year
- Fine-grained GPU sharing primitives☆148Jul 28, 2025Updated 9 months ago
- ☆21May 13, 2022Updated 3 years ago
- ☆23Jan 7, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆38Jun 27, 2025Updated 10 months ago
- ☆53Dec 26, 2024Updated last year
- ☆53Dec 13, 2022Updated 3 years ago
- Tiresias is a GPU cluster manager for distributed deep learning training.☆166May 7, 2020Updated 5 years ago
- a deep learning-driven scheduler for elastic training in deep learning clusters☆31Jan 14, 2021Updated 5 years ago
- ☆144Jan 30, 2025Updated last year
- Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020☆137Jul 25, 2024Updated last year
- GPU-scheduler-for-deep-learning☆209Nov 5, 2020Updated 5 years ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆94Jul 14, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Artifacts for our ASPLOS'23 paper ElasticFlow☆56May 10, 2024Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆36Jan 9, 2023Updated 3 years ago
- SAF: Streaming Analytics Framework☆30Mar 6, 2019Updated 7 years ago
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆12Mar 7, 2024Updated 2 years ago
- BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms☆11Aug 7, 2021Updated 4 years ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41May 31, 2021Updated 4 years ago
- Model-less Inference Serving☆94Nov 4, 2023Updated 2 years ago
- Multi-party Private Set Intersections & Threshold Set Intersections☆14Apr 2, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving☆37Dec 27, 2019Updated 6 years ago
- ☆22Nov 20, 2020Updated 5 years ago
- ☆31May 28, 2024Updated last year
- A low-latency prediction-serving system☆1,421Apr 26, 2021Updated 5 years ago
- ☆15Aug 15, 2024Updated last year
- HeliosArtifact☆22Sep 27, 2022Updated 3 years ago
- FilterForward: Scaling Video Analytics on Constrained Edge Nodes☆28Jan 27, 2020Updated 6 years ago
- ☆13Jun 20, 2019Updated 6 years ago
- The NYU Systems Seminar☆24Feb 26, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆134Feb 22, 2024Updated 2 years ago
- An extention of TVMScript to write simple and high performance GPU kernels with tensorcore.☆52Jul 23, 2024Updated last year
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆15Sep 18, 2020Updated 5 years ago
- Kubernetes Scheduler for Deep Learning☆263May 22, 2022Updated 3 years ago
- Resource-adaptive cluster scheduler for deep learning training.☆459Mar 5, 2023Updated 3 years ago
- ☆34May 16, 2023Updated 2 years ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆43May 29, 2022Updated 3 years ago