Sky Computing: Accelerating Geo-distributed Computing in Federated Learning
☆90Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for SkyComputing
Users that are interested in SkyComputing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Performance benchmarking with ColossalAI☆39Jul 6, 2022Updated 3 years ago
- Scalable PaLM implementation of PyTorch☆190Dec 19, 2022Updated 3 years ago
- Examples of training models with hybrid parallelism using ColossalAI☆339Mar 23, 2023Updated 3 years ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- ☆24Nov 22, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Large-scale model inference.☆629Sep 12, 2023Updated 2 years ago
- A collection of models built with ColossalAI☆33Nov 22, 2022Updated 3 years ago
- An IR for efficiently simulating distributed ML computation.☆33Jan 13, 2024Updated 2 years ago
- [Usenix Security '25] Robustifying ML-powered Network Classifiers with PANTS☆21Aug 16, 2025Updated 9 months ago
- GPT Demo with hybrid distributed training☆10Dec 1, 2022Updated 3 years ago
- ☆30Sep 4, 2023Updated 2 years ago
- Desktop version of ChatGPT, support manually set cookie☆19Dec 9, 2022Updated 3 years ago
- An evaluation framework for data center traffic engineering.☆14Jul 28, 2024Updated last year
- A Python library transfers PyTorch tensors between CPU and NVMe☆124Nov 27, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Mar 13, 2023Updated 3 years ago
- A memory efficient DLRM training solution using ColossalAI☆108Nov 22, 2022Updated 3 years ago
- Python3 auto-active verification library (migrated to an Intel project)☆24Apr 7, 2022Updated 4 years ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆13Mar 7, 2024Updated 2 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- GPU-scheduler-for-deep-learning☆212Nov 5, 2020Updated 5 years ago
- 不到100行代码实现一个Python迷你内网穿透、反向正向代理小工具☆12May 27, 2023Updated 3 years ago
- This repository compiles a list of papers/resources related to the graph retrieval-augmented generation! Star⭐ the repo and follow me if …☆10Dec 7, 2024Updated last year
- High performance distributed framework for training deep learning recommendation models based on PyTorch.☆412Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Manages vllm-nccl dependency☆18Jun 3, 2024Updated 2 years ago
- https://nnsmith-asplos.rtfd.io Artifact of "NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers" ASPLOS'23☆11Mar 29, 2023Updated 3 years ago
- ☆28Jul 11, 2021Updated 4 years ago
- a deep learning-driven scheduler for elastic training in deep learning clusters☆31Jan 14, 2021Updated 5 years ago
- Self Supervised Learning for Time Series Using Similarity Distillation☆11Jun 29, 2022Updated 3 years ago
- Source code for the paper titled: "Unlocking the full potential of smart charging: Addressing paused and delayed charging problems in ele…☆11May 22, 2024Updated 2 years ago
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Aug 29, 2022Updated 3 years ago
- SPATL: Salient Prameter Aggregation and Transfer Learning for Heterogeneous Federated Learning☆24Nov 17, 2022Updated 3 years ago
- Mitigating Routing Update Overhead for Traffic Engineering by Combining Destination-based Routing with Reinforcement Learning☆15Oct 16, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters☆58May 3, 2026Updated last month
- Benchmarking Machine Learning Model Inference in Data Streaming Solutions☆10Jun 12, 2024Updated 2 years ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆55Jul 3, 2022Updated 3 years ago
- ☆23Jan 7, 2022Updated 4 years ago
- ☆18May 3, 2024Updated 2 years ago
- A Gateway API(https://gateway-api.sigs.k8s.io) implementation, build on top of pipy.☆13Nov 14, 2025Updated 6 months ago
- Tiny Calculator with support of +, -, *, /, ^, sin, cos, tan...☆10Apr 2, 2024Updated 2 years ago