A Lightweight LLM Inference Performance Simulator
☆62Mar 2, 2026Updated last week
Alternatives and similar repositories for InferSim
Users that are interested in InferSim are comparing it to the libraries listed below
Sorting:
- The Chemical Reaction Optimization (CRO) algorithm with dependent classes in python 3.☆11Apr 21, 2020Updated 5 years ago
- parquet dedupe estimator☆25Feb 20, 2026Updated 2 weeks ago
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆13Jun 28, 2025Updated 8 months ago
- a model zoo☆11Jul 19, 2017Updated 8 years ago
- COSE: Configuring Serverless Functions using Statistical Learning☆10Jun 28, 2023Updated 2 years ago
- Bagua tutorials.☆13Sep 4, 2022Updated 3 years ago
- Slowdown prediction module of Echo: Simulating Distributed Training at Scale☆13May 17, 2025Updated 9 months ago
- ☆11Apr 12, 2020Updated 5 years ago
- GPU topology-aware scheduler☆13Jul 7, 2017Updated 8 years ago
- Intelligent Resource Requirement Estimation and Scheduling for Deep Learning Jobs on Distributed GPU Clusters☆15Nov 18, 2021Updated 4 years ago
- An OpenCV Android Camera Sudoku Solver☆11Apr 7, 2017Updated 8 years ago
- Android 人脸检测 android.media, play service, Face++☆11Aug 13, 2016Updated 9 years ago
- A RNN-based solver for the popular word game☆14Oct 21, 2023Updated 2 years ago
- TPAMI 2025 Survey Paper☆25Mar 31, 2025Updated 11 months ago
- survery of small language models☆18Jul 23, 2024Updated last year
- CCFDF rebar detection☆14Jun 14, 2019Updated 6 years ago
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Jun 30, 2023Updated 2 years ago
- A construction kit for reinforcement learning environment management.☆363Updated this week
- ☆16Feb 5, 2024Updated 2 years ago
- ☆18Oct 31, 2025Updated 4 months ago
- Open source version of DOCA GPUNetIO and DOCA Verbs libraries (limited features) to enable GDAKI technology on RDMA (IB and RoCE)☆32Feb 27, 2026Updated last week
- PyTorch-UVM on super-large language models.☆17Dec 21, 2020Updated 5 years ago
- [TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Om Chabra, Arash Nasr-Esfahany, Kevin Zhao, Pratees…☆17Nov 18, 2025Updated 3 months ago
- ☆16Apr 22, 2025Updated 10 months ago
- Geometry style transfer colorbook☆20Jan 5, 2024Updated 2 years ago
- Differencing based Self-supervised pretraining for scene change detection☆23Feb 15, 2023Updated 3 years ago
- A simple implementation of Tensorrt PPYOLOE☆17Jul 5, 2022Updated 3 years ago
- 语雀 Claude Code Plugin — 一键集成语雀 AI 能力☆56Feb 26, 2026Updated last week
- 仿抖音上下滑动列表播放短视频解决方案☆14Nov 25, 2019Updated 6 years ago
- Solution of the 3rd place in the 3rd YouTube-8M Video Understanding Challenge☆16Feb 28, 2020Updated 6 years ago
- Implementation of SPW and DPW for Monte Carlo Tree Search in Continuous action/state space☆20Oct 3, 2023Updated 2 years ago
- ☆19Sep 30, 2017Updated 8 years ago
- 🌞 a weather application on iOS (follow Yahoo Weather)☆20May 28, 2018Updated 7 years ago
- utility code for doing deep nlp in torch☆17May 16, 2017Updated 8 years ago
- ☆20Oct 5, 2022Updated 3 years ago
- Offline optimization of your disaggregated Dynamo graph☆199Updated this week
- A collection of LaTeX packages by Peter Wilson☆38Mar 15, 2025Updated 11 months ago
- A simple but well-performing "single-hop" visual attention model for the GQA dataset☆20Aug 8, 2019Updated 6 years ago
- ☆21Jun 21, 2018Updated 7 years ago