☆50Oct 14, 2025Updated 7 months ago
Alternatives and similar repositories for LLMSimulator
Users that are interested in LLMSimulator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Feb 26, 2023Updated 3 years ago
- ☆15Apr 18, 2024Updated 2 years ago
- ☆155Jun 24, 2024Updated last year
- ☆116Jul 4, 2024Updated last year
- ☆12Jul 2, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simulator code of the paper "Dissecting and Modeling the Architecture of Modern GPU Cores"☆92Oct 15, 2025Updated 7 months ago
- This is where gem5 based DRAM cache models live.☆20Mar 23, 2023Updated 3 years ago
- ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference☆205Jan 8, 2026Updated 5 months ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆138May 3, 2025Updated last year
- [ASP-DAC 2025] "NeuronQuant: Accurate and Efficient Post-Training Quantization for Spiking Neural Networks" Official Implementation☆19Mar 6, 2025Updated last year
- PALM: A Efficient Performance Simulator for Tiled Accelerators with Large-scale Model Training☆22Jun 12, 2024Updated last year
- Processing-In-Memory (PIM) Simulator☆238Dec 12, 2024Updated last year
- Anatomy of a powerhouse: SystemVerilog TPU based on Google TPU v1☆22Nov 9, 2025Updated 7 months ago
- Cheddar: A Swift Fully Homomorphic Encryption (FHE) GPU Library☆86Apr 9, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Cycle-level simulator for M2NDP☆37Aug 14, 2025Updated 9 months ago
- RISC-V Superscalar Educational Simulator based on Tomasulo's Algorithm☆32Nov 1, 2025Updated 7 months ago
- [HPCA 2026 Best Paper Candidate] Official implementation of "Focus: A Streaming Concentration Architecture for Efficient Vision-Language …☆54Feb 8, 2026Updated 4 months ago
- Draft-Target Disaggregation LLM Serving System via Parallel Speculative Decoding.☆206Mar 18, 2026Updated 2 months ago
- Extending PyTorch to Fully Homomorphic Encryption☆121May 21, 2026Updated 3 weeks ago
- Here are some implementations of basic hardware units in RTL language (verilog for now), which can be used for area/power evaluation and …☆14Aug 25, 2023Updated 2 years ago
- Computational Memory Neural Network Compiler☆11Aug 11, 2021Updated 4 years ago
- The official implementation of HPCA 2025 paper, Prosperity: Accelerating Spiking Neural Networks via Product Sparsity☆39Aug 9, 2025Updated 10 months ago
- ☆30Feb 27, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Jul 3, 2020Updated 5 years ago
- ☆59Apr 9, 2026Updated 2 months ago
- WaferLLM: Large Language Model Inference at Wafer Scale☆108Apr 4, 2026Updated 2 months ago
- ☆42May 19, 2026Updated 3 weeks ago
- Codes for our paper "Exploring Bit-Slice Sparsity in Deep Neural Networks for Efficient ReRAM-Based Deployment" [NeurIPS'19 EMC2 workshop]…☆10Oct 12, 2020Updated 5 years ago
- ☆14Oct 11, 2024Updated last year
- An implementation of RV32I based on EECS151☆10Jan 30, 2024Updated 2 years ago
- A backend agnostic modular FHE library over the Torus using bivariate polynomial representation☆68Updated this week
- An analytical framework that models hardware dataflow of tensor applications on spatial architectures using the relation-centric notation…☆88Apr 28, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Parallel Simulation Framework For Multicore Systems☆11May 20, 2017Updated 9 years ago
- ☆12Jan 13, 2023Updated 3 years ago
- A paper review list for computer architecture and systems research, maintained by the LEMONADE group at Peking University.☆20Updated this week
- Lab for Digital Design and Computer Architecture Spring 2022 (252-0028-00L) (ETH).☆14Mar 1, 2023Updated 3 years ago
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- A synthesis flow for hybrid processing-in-RRAM modes☆12Jul 15, 2021Updated 4 years ago
- H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference☆108Apr 26, 2025Updated last year