yuanxinnn / APTMoEView external linksLinks
☆12Jun 29, 2024Updated last year
Alternatives and similar repositories for APTMoE
Users that are interested in APTMoE are comparing it to the libraries listed below
Sorting:
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆56May 29, 2024Updated last year
- HeliosArtifact☆22Sep 27, 2022Updated 3 years ago
- [NeurIPS 2025] ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive☆66Dec 11, 2025Updated 2 months ago
- PyTorch library for cost-effective, fast and easy serving of MoE models.☆280Feb 2, 2026Updated 2 weeks ago
- Repo for transient training paper at ICAC 2019.☆11Oct 5, 2022Updated 3 years ago
- ☆10Mar 8, 2025Updated 11 months ago
- NUST-API集合☆10Oct 29, 2018Updated 7 years ago
- This is a command line interface for the Rec Cloud Service (rec.ustc.edu.cn)☆15Oct 24, 2025Updated 3 months ago
- ☆11Apr 10, 2025Updated 10 months ago
- ☆12Sep 28, 2024Updated last year
- ☆12Aug 18, 2023Updated 2 years ago
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated last year
- 微信Ipad协议golang版本,基于grpc的实现策略。这套代码需要通过gprc服务端组包解包才可以正常使用☆12Jul 8, 2019Updated 6 years ago
- A standalone CXL-enabled system simulator.☆18Jan 10, 2026Updated last month
- This is the final project of 2020 DBMS course in SYSU☆10Jun 23, 2020Updated 5 years ago
- a simple API to use CUPTI☆11Aug 19, 2025Updated 5 months ago
- ☆15Nov 11, 2024Updated last year
- ☆12Apr 6, 2025Updated 10 months ago
- Integrating Event-based Dynamic Vision Sensors with Sparse Hyperdimensional Computing☆11Jul 9, 2020Updated 5 years ago
- ☆14Dec 14, 2025Updated 2 months ago
- Course Project for High Level Chip Design (高层次芯片设计)☆17Jan 2, 2025Updated last year
- ☆10Sep 14, 2023Updated 2 years ago
- ☆13Jan 13, 2025Updated last year
- Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".☆21May 23, 2025Updated 8 months ago
- Compiler for Dynamic Neural Networks☆45Nov 13, 2023Updated 2 years ago
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated last year
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- Using AMQP as a Queuing Backend for Web Apps Should Be Easy☆111Dec 27, 2009Updated 16 years ago
- Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆46Nov 24, 2022Updated 3 years ago
- Barge running on xhyve hypervisor☆15Jun 7, 2022Updated 3 years ago
- Work in progress LLM framework.☆15Oct 31, 2024Updated last year
- We present a set of all-reduce compatible gradient compression algorithms which significantly reduce the communication overhead while mai…☆10Nov 14, 2021Updated 4 years ago
- SAML2 authenticaticating proxy☆10Jul 28, 2014Updated 11 years ago
- Utility that converts from PDF to EPUB format☆16Apr 4, 2010Updated 15 years ago
- Python implementation of a Genetic Algorithm for the Resource-Constrained Project Scheduling Problem☆14May 29, 2023Updated 2 years ago
- Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".☆19Oct 30, 2024Updated last year
- ☆11Dec 18, 2020Updated 5 years ago
- Generative Models for Image Captioning☆10Jun 7, 2017Updated 8 years ago
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago