Tjyy-1223 / NeurosurgeonView external linksLinks
云边协同- collaborative inference 📚Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge
☆104Jul 29, 2023Updated 2 years ago
Alternatives and similar repositories for Neurosurgeon
Users that are interested in Neurosurgeon are comparing it to the libraries listed below
Sorting:
- 云边协同- collaborative inference📚Dynamic adaptive DNN surgery for inference acceleration on the edge☆45Jul 30, 2023Updated 2 years ago
- 云边协同- collaborative inference📚工作汇总 📚 Collaborative Inference Work Summary☆97Jan 2, 2025Updated last year
- PyTorch implementation of the paper: Multi-Agent Collaborative Inference via DNN Decoupling: Intermediate Feature Compression and Edge Le…☆45Oct 26, 2023Updated 2 years ago
- A PyTorch Implementation for experiements in paper: Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge.☆17May 29, 2023Updated 2 years ago
- The implementation of paper : RTCoInfer: Real-time Edge-Cloud Collaborative CNN Inference for Stream Analytics on Ubiquitous Images☆17Oct 18, 2022Updated 3 years ago
- Autodidactic Neurosurgeon Collaborative Deep Inference for Mobile Edge Intelligence via Online Learning☆42Aug 14, 2021Updated 4 years ago
- Code for paper "JMDC: A Joint Model and Data Compression System for Deep Neural Networks Collaborative Computing in Edge-Cloud Networks"☆23Aug 24, 2025Updated 5 months ago
- This repository contains some of the codes for paper "Combining DNN partitioning and early exit" published in EdgeSys '22: Proceedings of…☆12Jul 20, 2023Updated 2 years ago
- ☆13Jan 14, 2020Updated 6 years ago
- DNN partition edge-cloud co-infer☆12Jun 11, 2023Updated 2 years ago
- 随着移动云计算和边缘计算的快速发展,以及人工智能的广泛应用,产生了边缘智能(Edge Intelligence)的概念。深度神经网络(例如CNN)已被广泛应用于移动智能应用程序中,但是移动设备有限的存储和计算资源无法满足深度神经网络计算的需求。神经网络压缩与加速技术可以加速…☆308Jul 13, 2022Updated 3 years ago
- ☆23Apr 10, 2023Updated 2 years ago
- Code for paper "Joint Architecture Design and Workload Partitioning for DNN Inference on Industrial IoT Clusters"☆15Aug 22, 2025Updated 5 months ago
- DNN_Partition辅助工具,用于对pytorch模型进行简单的性能分析以及支持模型切分☆14May 31, 2021Updated 4 years ago
- ☆27Feb 25, 2021Updated 4 years ago
- PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices☆37Jan 31, 2024Updated 2 years ago
- [TMC'22] SplitPlace: AI Augmented Splitting and Placement of Large-Scale Neural Networks in Mobile Edge Environments☆21Dec 8, 2022Updated 3 years ago
- PyTorch implementation of the paper: Decomposing Vision Transformers for Collaborative Inference in Edge Devices☆18Jul 27, 2024Updated last year
- ☆26Oct 14, 2023Updated 2 years ago
- ☆13Mar 14, 2023Updated 2 years ago
- 2-stage pruning to favor distributed inference (local device compute half of the model, upload the feature for further computing on stron…☆23May 31, 2018Updated 7 years ago
- Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency☆27Jan 18, 2021Updated 5 years ago
- Code for paper "Locally Distributed Deep Learning Inference on Edge Device Clusters"☆14Aug 22, 2025Updated 5 months ago
- Deep neural network (DNN) implementation for inference tasks☆13Jul 4, 2019Updated 6 years ago
- Code for the paper: "BottleNet++: An End-to-End Approach for Feature Compression in Device-Edge Co-Inference Systems"☆53Sep 5, 2021Updated 4 years ago
- ☆12Nov 16, 2020Updated 5 years ago
- 云边协同,模型推理通信框架☆15Aug 24, 2022Updated 3 years ago
- Edge YOLO paper☆14Aug 4, 2021Updated 4 years ago
- [IEEE TMC 2020] "Computation Offloading in Multi-Access Edge Computing: A Multi-Task Learning Approach" and [IEEE GlobeCom 2023] "A Multi…☆92Apr 15, 2024Updated last year
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆88Mar 18, 2020Updated 5 years ago
- ☆71Jul 1, 2023Updated 2 years ago
- Auto-Split: A General Framework of Collaborative Edge-Cloud AI☆14Aug 31, 2021Updated 4 years ago
- Codes for the paper titled Online Joint Task Offloading and Resource Management in Heterogeneous Mobile Edge Environments.☆18Dec 7, 2022Updated 3 years ago
- Code for paper "Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI" (MobiCom'22)☆18Apr 13, 2023Updated 2 years ago
- 2021 Summer Research Internship project (UROP) at Imperial College London. Supervised by Prof George Constantinides and Ben Biggs☆17Dec 17, 2022Updated 3 years ago
- (Code) Multi-objective Sparrow Search Optimization for Task Scheduling in Fog-Cloud-Blockchain Systems☆20Oct 17, 2023Updated 2 years ago
- ☆22Oct 2, 2021Updated 4 years ago
- SustainCluster: A high-fidelity, open-source Gymnasium environment for benchmarking multi-objective, sustainable workload scheduling acro…☆45Jan 12, 2026Updated last month
- TransEdge: Task Offloading with GNN and DRL in Edge Computing-Enabled Transportation System☆43Aug 26, 2024Updated last year