[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo
☆63Aug 5, 2025Updated 6 months ago
Alternatives and similar repositories for Ayo
Users that are interested in Ayo are comparing it to the libraries listed below
Sorting:
- Slowdown prediction module of Echo: Simulating Distributed Training at Scale☆13May 17, 2025Updated 9 months ago
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆210Sep 21, 2024Updated last year
- [TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Om Chabra, Arash Nasr-Esfahany, Kevin Zhao, Pratees…☆16Nov 18, 2025Updated 3 months ago
- ☆19May 10, 2025Updated 9 months ago
- ☆21Nov 13, 2025Updated 3 months ago
- A large-scale simulation framework for LLM inference☆539Jul 25, 2025Updated 7 months ago
- A repository for the Kramabench benchmark☆39Updated this week
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆25May 12, 2025Updated 9 months ago
- Surrogate-based Hyperparameter Tuning System☆28Jun 29, 2023Updated 2 years ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆77Oct 15, 2025Updated 4 months ago
- NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading☆84Jun 16, 2025Updated 8 months ago
- The wafer-native AI accelerator simulation platform and inference engine.☆50Jan 1, 2026Updated 2 months ago
- ☆59Dec 4, 2025Updated 2 months ago
- Data repository of NAssim☆29Aug 18, 2022Updated 3 years ago
- A low-latency & high-throughput serving engine for LLMs☆480Jan 8, 2026Updated last month
- LLM Serving Performance Evaluation Harness☆83Feb 25, 2025Updated last year
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- Text preprocessing package for use in NLP tasks https://pypi.org/project/textcl/☆11Aug 9, 2024Updated last year
- Lightweight framework for 3D rendering.☆11Jun 5, 2023Updated 2 years ago
- Crater Backend is the web backend of Crater System.☆22Nov 2, 2025Updated 3 months ago
- ☆94Jul 3, 2022Updated 3 years ago
- This repo implements an interface to GTAV for SCENIC language.☆11Dec 7, 2019Updated 6 years ago
- Efficient Hyper-parameter Tuning at Scale (VLDB'22)☆10Dec 1, 2021Updated 4 years ago
- ☆16Jan 14, 2025Updated last year
- Source code for the paper: "Pantheon: Preemptible Multi-DNN Inference on Mobile Edge GPUs"☆16Apr 15, 2024Updated last year
- ☆12Nov 8, 2024Updated last year
- Memory-mapped VGA display for Xilinx/Zynq/Zedboard, with demo code for using it.☆15Feb 26, 2018Updated 8 years ago
- INFINEL: An efficient GPU-based processing method for unpredictable large output graph queries [PPoPP'24]☆10Jan 15, 2024Updated 2 years ago
- Offline RandomAPI npm module☆12Apr 22, 2018Updated 7 years ago
- A platform that provides users with easy access to AI services developed by Montimage and usage of explainable AI techniques (e.g., LIME,…☆10Feb 17, 2026Updated last week
- A Distributed Analysis and Benchmarking Framework for Apache OpenWhisk Serverless Platform☆12Dec 11, 2018Updated 7 years ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- Elastic computing platform☆30Feb 15, 2026Updated 2 weeks ago
- Code for SIGKDD2025 paper: An Efficient Diffusion-based Non-Autoregressive Solver for Traveling Salesman Problem☆14Jan 28, 2025Updated last year
- SJTU 中文简约 LaTeX 报告模板☆10Jun 7, 2021Updated 4 years ago
- A TUI signal waveform viewer.☆23Mar 27, 2025Updated 11 months ago
- A Benchmark for Transactional Database Performance Anomalies☆12Nov 21, 2023Updated 2 years ago
- 实时交互输入辅助工具☆10Apr 7, 2022Updated 3 years ago
- SplitBud is a Split Learning framework built upon Flower☆14Mar 22, 2025Updated 11 months ago