a static analytical model for LLM distributed training
☆116Jan 8, 2026Updated last month
Alternatives and similar repositories for SimuMax
Users that are interested in SimuMax are comparing it to the libraries listed below
Sorting:
- An open source SDR SDRAM controller based on the AXI4 bus and verified by FPGA and tapeout. It can support memory particles of different …☆22May 12, 2025Updated 9 months ago
- some knowleage about SystemC/TLM etc.☆28Jun 8, 2023Updated 2 years ago
- This repository is outdated and the related functionality has been migrated to https://github.com/easysoc/easysoc-firrtl☆11Nov 3, 2021Updated 4 years ago
- ☆13May 8, 2025Updated 9 months ago
- ☆12Feb 20, 2026Updated last week
- ☆11Jun 29, 2021Updated 4 years ago
- This is the GUI of X0-Compiler.☆10Sep 21, 2019Updated 6 years ago
- [ASP-DAC 2025] "NeuronQuant: Accurate and Efficient Post-Training Quantization for Spiking Neural Networks" Official Implementation☆15Mar 6, 2025Updated 11 months ago
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 5 months ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Aug 19, 2025Updated 6 months ago
- Fork from https://github.com/deepseek-ai/FlashMLA☆16Feb 26, 2025Updated last year
- A synthesis flow for hybrid processing-in-RRAM modes☆12Jul 15, 2021Updated 4 years ago
- MatchLib Connections Toolkit - example designs leveraging Connections☆16Jan 6, 2026Updated last month
- My attempt to improve the speed of the newton schulz algorithm, starting from the dion implementation.☆30Dec 5, 2025Updated 2 months ago
- Analysis for the traces from byteprofile☆32Nov 21, 2023Updated 2 years ago
- ☆224Oct 24, 2025Updated 4 months ago
- contains TLM2 based interfaces for AXI, ACE, CHI and other standard protocols☆63Updated this week
- HyFiSS: A Hybrid Fidelity Stall-Aware Simulator for GPGPUs☆39Dec 9, 2024Updated last year
- Artifacts for ATC '22 paper "Faster Software Packet Processing on FPGA NICs with eBPF Program Warping"☆17May 20, 2022Updated 3 years ago
- Repository for MLCommons Chakra schema and tools☆39Dec 24, 2023Updated 2 years ago
- ☆166Feb 22, 2024Updated 2 years ago
- ☆38Aug 7, 2025Updated 6 months ago
- LLMA = LLM + Arithmetic coder, which use LLM to do insane text data compression. LLMA=大模型+算术编码,它能使用LLM对文本数据进行暴力的压缩,达到极高的压缩率。☆22Nov 24, 2024Updated last year
- An HPL-AI implementation for Fugaku☆23Jun 29, 2021Updated 4 years ago
- corundum work on vu13p☆23Nov 10, 2023Updated 2 years ago
- This is simple code of SpikedAttention (Neurips 2024)☆23Mar 30, 2025Updated 11 months ago
- Performance Prediction Toolkit☆56Sep 13, 2025Updated 5 months ago
- C++ RTL simulator for EIE(https://arxiv.org/abs/1602.01528)☆23Mar 17, 2021Updated 4 years ago
- This is the open-source version of TinyTS. The code is dirty so far. We may clean the code in the future.☆19Aug 11, 2025Updated 6 months ago
- Linux on RISC-V on FPGA (LOROF): RV64GC Sv39 Quad-Core Superscalar Out-of-Order Virtual Memory CPU☆15Updated this week
- ☆52Jan 16, 2025Updated last year
- STONNE: A Simulation Tool for Neural Networks Engines☆147Jun 16, 2025Updated 8 months ago
- DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.☆93Jan 16, 2026Updated last month
- GPGPU-Sim 中文注释版代码,包含 GPGPU-Sim 模拟器的最新版代码,经过中文注释,以帮助中文用户更好地理解和使用该模拟器。☆28Dec 18, 2024Updated last year
- Pulp virtual platform☆24Jul 16, 2025Updated 7 months ago
- MAD (Model Automation and Dashboarding)☆31Updated this week
- Ongoing research training transformer models at scale☆37Feb 20, 2026Updated last week
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆32Jun 13, 2025Updated 8 months ago
- GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated…☆67Jan 22, 2026Updated last month