Sky Computing: Accelerating Geo-distributed Computing in Federated Learning
☆90Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for SkyComputing
Users that are interested in SkyComputing are comparing it to the libraries listed below
Sorting:
- Performance benchmarking with ColossalAI☆38Jul 6, 2022Updated 3 years ago
- Scalable PaLM implementation of PyTorch☆190Dec 19, 2022Updated 3 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 6 months ago
- Examples of training models with hybrid parallelism using ColossalAI☆339Mar 23, 2023Updated 2 years ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Large-scale model inference.☆627Sep 12, 2023Updated 2 years ago
- ☆24Nov 22, 2022Updated 3 years ago
- GPT4 based personalized ArXiv paper assistant bot☆12Mar 1, 2024Updated 2 years ago
- GPT Demo with hybrid distributed training☆10Dec 1, 2022Updated 3 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- https://nnsmith-asplos.rtfd.io Artifact of "NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers" ASPLOS'23☆11Mar 29, 2023Updated 2 years ago
- An evaluation framework for data center traffic engineering.☆13Jul 28, 2024Updated last year
- Benchmarking Machine Learning Model Inference in Data Streaming Solutions☆10Jun 12, 2024Updated last year
- JNumberTools is an open-source Java library for solving complex problems in combinatorics and number theory. Whether you're a researcher,…☆12May 13, 2025Updated 9 months ago
- Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation☆19Jun 11, 2025Updated 8 months ago
- A Sparse-tensor Communication Framework for Distributed Deep Learning☆13Nov 1, 2021Updated 4 years ago
- 不到100行代码实现一个Python迷你内网穿透、反向正向代理小工具☆12May 27, 2023Updated 2 years ago
- Mitigating Routing Update Overhead for Traffic Engineering by Combining Destination-based Routing with Reinforcement Learning☆15Oct 16, 2022Updated 3 years ago
- This repository compiles a list of papers/resources related to the graph retrieval-augmented generation! Star⭐ the repo and follow me if …☆11Dec 7, 2024Updated last year
- Store the ATD/openapi/protobuf/... interfaces between semgrep components☆18Updated this week
- [ICDCS 2023] DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining☆12Dec 4, 2023Updated 2 years ago
- GPU-accelerated LLM Training Simulator☆17Jun 26, 2025Updated 8 months ago
- a deep learning-driven scheduler for elastic training in deep learning clusters☆31Jan 14, 2021Updated 5 years ago
- A memory efficient DLRM training solution using ColossalAI☆107Nov 22, 2022Updated 3 years ago
- ☆39Oct 3, 2022Updated 3 years ago
- Benchmark PyTorch Custom Operators☆14Jul 6, 2023Updated 2 years ago
- GPU-scheduler-for-deep-learning☆210Nov 5, 2020Updated 5 years ago
- Arya: Arbitrary Graph Pattern Mining with Decomposition-based Sampling☆16Sep 27, 2023Updated 2 years ago
- [Usenix Security '25] Robustifying ML-powered Network Classifiers with PANTS☆20Aug 16, 2025Updated 6 months ago
- ☆14Nov 7, 2025Updated 3 months ago
- Wraps the NVDLA project for Chipyard integration☆22Sep 2, 2025Updated 6 months ago
- Desktop version of ChatGPT, support manually set cookie☆19Dec 9, 2022Updated 3 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- Manages vllm-nccl dependency☆17Jun 3, 2024Updated last year
- ☆21May 13, 2022Updated 3 years ago
- Selected Topics in Computer Networks @ Johns Hopkins University☆19Dec 17, 2020Updated 5 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 2 years ago
- ☆65Apr 30, 2025Updated 10 months ago
- ☆20Jun 3, 2023Updated 2 years ago