jasperzhong / swiftView external linksLinks
☆15Apr 20, 2022Updated 3 years ago
Alternatives and similar repositories for swift
Users that are interested in swift are comparing it to the libraries listed below
Sorting:
- ☆56Jan 25, 2021Updated 5 years ago
- ☆13Feb 22, 2023Updated 2 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- A Generic Resource-Aware Hyperparameter Tuning Execution Engine☆15Jan 8, 2022Updated 4 years ago
- Mu: Microsecond Consensus for Microsecond Applications☆41Oct 12, 2020Updated 5 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated last year
- Universal Presentation: A Header-only C++ Library to Cout STL containers and more☆18Aug 14, 2023Updated 2 years ago
- ☆38Jan 15, 2021Updated 5 years ago
- ☆44Jul 4, 2024Updated last year
- Primo: Practical Learning-Augmented Systems with Interpretable Models☆19Dec 26, 2023Updated 2 years ago
- ☆44Sep 6, 2021Updated 4 years ago
- ☆23Jan 7, 2022Updated 4 years ago
- Surrogate-based Hyperparameter Tuning System☆28Jun 29, 2023Updated 2 years ago
- ☆22Nov 7, 2018Updated 7 years ago
- [ACM EuroSys 2023] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆56Aug 6, 2025Updated 6 months ago
- Privacy Budget Orchestration in Machine Learning Workloads (OSDI '21)☆26Oct 20, 2023Updated 2 years ago
- Lucid: A Non-Intrusive, Scalable and Interpretable Scheduler for Deep Learning Training Jobs☆58May 21, 2023Updated 2 years ago
- ☆27May 31, 2023Updated 2 years ago
- ☆29Oct 27, 2023Updated 2 years ago
- Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.☆28Apr 25, 2023Updated 2 years ago
- 📑 Dive into Big Model Training☆116Dec 1, 2022Updated 3 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆34Feb 10, 2025Updated last year
- ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.☆27Jul 6, 2023Updated 2 years ago
- Analysis for the traces from byteprofile☆32Nov 21, 2023Updated 2 years ago
- Training neural networks in TensorFlow 2.0 with 5x less memory☆137Feb 21, 2022Updated 3 years ago
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …☆36Aug 29, 2025Updated 5 months ago
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆34Nov 11, 2023Updated 2 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆35Jan 9, 2023Updated 3 years ago
- Carbon Explorer helps evaluating solutions make datacenters operate on renewable energy.☆87Nov 8, 2024Updated last year
- Dirigent: Lightweight Serverless Orchestration☆41Aug 26, 2025Updated 5 months ago
- A schedule language for large model training☆152Aug 21, 2025Updated 5 months ago
- Sparse symmetric indefinite solver implemented with a runtime system☆13May 11, 2020Updated 5 years ago
- Edge Impulse FOMO Implementation from scratch☆15Jul 25, 2025Updated 6 months ago
- SOTA Learning-augmented Systems☆37May 21, 2022Updated 3 years ago
- Reinforcement Learning (PPO) applied to a multiplayer simple card game (Witches)☆10Jun 7, 2020Updated 5 years ago
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆164Jan 12, 2026Updated last month
- This simulator models multi core systems, intended primarily for studies on main memory management techniques. It models a trace-based ou…☆12Jan 18, 2016Updated 10 years ago
- 基于AnimeGAN2+serverless+NAS存储的漫画风图片生成工具(demo 已失效)☆12May 11, 2022Updated 3 years ago