PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices
☆39Jan 31, 2024Updated 2 years ago
Alternatives and similar repositories for PipeEdge
Users that are interested in PipeEdge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of the paper: Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge Le…☆47Oct 26, 2023Updated 2 years ago
- [EMSOFT 2022] Adaptive Edge Offloading for Image Classification Under Rate Limit☆15Jul 17, 2023Updated 2 years ago
- [TMC'22] SplitPlace: AI Augmented Splitting and Placement of Large-Scale Neural Networks in Mobile Edge Environments☆22Dec 8, 2022Updated 3 years ago
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆16Feb 21, 2025Updated last year
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …☆39Aug 29, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A simulator for large energy-aware fog computing environments 🌱☆10Jul 8, 2021Updated 4 years ago
- Three Agent-Based Simulation for Edge Computing in 5G and Beyond for the recent paper titled "Design and Simulation of a Hybrid Architect…☆20Oct 26, 2021Updated 4 years ago
- Code for paper "JMDC: A Joint Model and Data Compression System for Deep Neural Networks Collaborative Computing in Edge-Cloud Networks"☆25Aug 24, 2025Updated 8 months ago
- Edge computing system for mUAV crowd identification and monitoring.☆11Oct 30, 2021Updated 4 years ago
- 5G-Slicer: An emulator for mobile IoT applications deployed over 5G network slices☆16Apr 28, 2022Updated 4 years ago
- SLO-aware Kubernetes scheduler for the Edge and Cloud☆15Apr 28, 2023Updated 3 years ago
- 云边协同- collaborative inference📚工作汇总 📚 Collaborative Inference Work Summary☆98Jan 2, 2025Updated last year
- Official repository for the paper "Deploying a smart queuing system on edge with Intel OpenVINO toolkit" [Springer-2021]☆14Sep 22, 2020Updated 5 years ago
- Google News Parser☆12Apr 18, 2016Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [DMLR 2024] FedAIoT: A Federated Learning Benchmark for Artificial Intelligence of Things☆59Aug 27, 2024Updated last year
- Multi-branch model for concurrent execution☆18Jun 27, 2023Updated 2 years ago
- Educational tutorials for speech and language processing classes☆12Jan 8, 2019Updated 7 years ago
- ☆16May 3, 2024Updated 2 years ago
- Estimate MFU for DeepSeekV3☆26Jan 5, 2025Updated last year
- ☆14Oct 18, 2023Updated 2 years ago
- WWDC 2020 Swift Student Challenge Submission "6 Feet Between" by Tony Tang☆10Jun 17, 2020Updated 5 years ago
- docker compose outline☆11Apr 22, 2023Updated 3 years ago
- ☆24Aug 15, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Aug 2, 2023Updated 2 years ago
- A pytorch implementation of the ICCV2021 workshop paper SimDis: Simple Distillation Baselines for Improving Small Self-supervised Models☆14Jul 15, 2021Updated 4 years ago
- Federated Fairness-aware Recommendation☆14Sep 2, 2022Updated 3 years ago
- Sparse CNN Accelerator targeting Intel FPGA☆14Aug 26, 2021Updated 4 years ago
- ☆13Jun 18, 2019Updated 6 years ago
- bitfusion verilog implementation☆13Feb 21, 2022Updated 4 years ago
- ☆14Oct 6, 2023Updated 2 years ago
- 北京交通大学计算机科学与技术学院系统与网络实验室☆25Updated this week
- Python-based modeling and simulation framework for Edge Computing resource management policies☆104May 1, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR'20] ZeroQ Mixed-Precision implementation (unofficial): A Novel Zero Shot Quantization Framework☆14Dec 16, 2020Updated 5 years ago
- Code needed to reproduce results from my ICLR 2019 paper on fixed-point quantization of the backprop algorithm.☆10Jan 24, 2019Updated 7 years ago
- ☆18Apr 15, 2025Updated last year
- copyright management system using ERC721 token on a blockchain☆11Dec 1, 2020Updated 5 years ago
- 基于Xilinx FPGA的通用型 CNN卷积神经网络加速器,本设计基于KV260板卡,MpSoC架构均可移植☆21Dec 13, 2024Updated last year
- ☆13Jan 14, 2020Updated 6 years ago
- 遗传算法解决混合流水车间调度问题☆12Mar 25, 2020Updated 6 years ago