Sky Computing: Accelerating Geo-distributed Computing in Federated Learning
☆90Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for SkyComputing
Users that are interested in SkyComputing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Performance benchmarking with ColossalAI☆39Jul 6, 2022Updated 3 years ago
- Scalable PaLM implementation of PyTorch☆190Dec 19, 2022Updated 3 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 10 months ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Large-scale model inference.☆629Sep 12, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation☆21Jun 11, 2025Updated last year
- A collection of models built with ColossalAI☆33Nov 22, 2022Updated 3 years ago
- An IR for efficiently simulating distributed ML computation.☆33Jan 13, 2024Updated 2 years ago
- [Usenix Security '25] Robustifying ML-powered Network Classifiers with PANTS☆22Aug 16, 2025Updated 10 months ago
- Elixir: Train a Large Language Model on a Small GPU Cluster☆16Jun 8, 2023Updated 3 years ago
- ☆30Sep 4, 2023Updated 2 years ago
- Desktop version of ChatGPT, support manually set cookie☆19Dec 9, 2022Updated 3 years ago
- An evaluation framework for data center traffic engineering.☆14Jul 28, 2024Updated last year
- A Python library transfers PyTorch tensors between CPU and NVMe☆124Nov 27, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Mar 13, 2023Updated 3 years ago
- A memory efficient DLRM training solution using ColossalAI☆108Nov 22, 2022Updated 3 years ago
- Python3 auto-active verification library (migrated to an Intel project)☆24Apr 7, 2022Updated 4 years ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆13Mar 7, 2024Updated 2 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- GPU-scheduler-for-deep-learning☆213Nov 5, 2020Updated 5 years ago
- This repository compiles a list of papers/resources related to the graph retrieval-augmented generation! Star⭐ the repo and follow me if …☆10Dec 7, 2024Updated last year
- High performance distributed framework for training deep learning recommendation models based on PyTorch.☆414Jun 22, 2026Updated last week
- Manages vllm-nccl dependency☆18Jun 3, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- https://nnsmith-asplos.rtfd.io Artifact of "NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers" ASPLOS'23☆11Mar 29, 2023Updated 3 years ago
- ☆28Jul 11, 2021Updated 4 years ago
- Source code for the paper titled: "Unlocking the full potential of smart charging: Addressing paused and delayed charging problems in ele…☆11May 22, 2024Updated 2 years ago
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Aug 29, 2022Updated 3 years ago
- SPATL: Salient Prameter Aggregation and Transfer Learning for Heterogeneous Federated Learning☆24Nov 17, 2022Updated 3 years ago
- Mitigating Routing Update Overhead for Traffic Engineering by Combining Destination-based Routing with Reinforcement Learning☆15Oct 16, 2022Updated 3 years ago
- Benchmarking Machine Learning Model Inference in Data Streaming Solutions☆10Jun 12, 2024Updated 2 years ago
- [TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Om Chabra, Arash Nasr-Esfahany, Kevin Zhao, Pratees…☆21Jun 19, 2026Updated last week
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆55Jul 3, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- pFedDef: Defending Grey-Box Attacks for Personalized Federated Learning☆10May 31, 2023Updated 3 years ago
- ☆23Jan 7, 2022Updated 4 years ago
- ☆18May 3, 2024Updated 2 years ago
- Artifacts accompanying the NSDI '24 paper: Leo: Online ML-based Traffic Classification at Multi-Terabit Line Rate.☆22Jun 13, 2026Updated 2 weeks ago
- Tiny Calculator with support of +, -, *, /, ^, sin, cos, tan...☆10Apr 2, 2024Updated 2 years ago
- GHive: Accelerating Analytical Query Processing in Apache Hive via CPU-GPU Heterogeneous Computing.☆14Nov 8, 2023Updated 2 years ago
- ☆15Jun 2, 2024Updated 2 years ago