scai-tech/NeuSight

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scai-tech/NeuSight)

scai-tech / NeuSight

☆83

Alternatives and similar repositories for NeuSight

Users that are interested in NeuSight are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

astra-sim / libra
View on GitHub
LIBRA: Enabling Workload-aware Multi-dimensional Network Topology Optimization for Distributed Training of Large AI Models
☆12May 7, 2024Updated 2 years ago
yuezuegu / sosa-compiler
View on GitHub
Repository for compilation and cycle-accurate simulator for scale-out systolic arrays
☆16Jan 4, 2023Updated 3 years ago
astra-sim / astra-sim
View on GitHub
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
☆649Apr 25, 2026Updated 3 months ago
eth-easl / gpu-util-interference
View on GitHub
CUDA benchmarks for measuring GPU utilization and interference
☆18Feb 11, 2025Updated last year
microsoft / vidur
View on GitHub
Accurate, large-scale, and extensible simulator for LLM inference Systems
☆648Jul 25, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pku-lemonade / TokenSim
View on GitHub
TokenSim is a tool for simulating the behavior of large language models (LLMs) in a distributed environment.
☆27Jun 26, 2026Updated last month
alibaba / hap
View on GitHub
☆16Apr 13, 2024Updated 2 years ago
muriloboratto / NVSHEMEM
View on GitHub
Sample Codes using NVSHMEM on Multi-GPU
☆30Jan 22, 2023Updated 3 years ago
casys-kaist / LLMServingSim
View on GitHub
LLMServingSim 2.0: A Unified Simulator for Heterogeneous and Disaggregated LLM Serving Infrastructure
☆347Updated this week
sjtu-epcc / Tacker
View on GitHub
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
☆33Feb 10, 2025Updated last year
yonsei-hpcp / gcom
View on GitHub
☆15May 8, 2025Updated last year
duowuyms / OpenCATP-LLM
View on GitHub
The official repository of ICCV 2025 paper "CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning".
☆21Nov 26, 2025Updated 8 months ago
awslabs / optimizing-multitask-training-through-dynamic-pipelines
View on GitHub
Official repository for the paper DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines
☆19Dec 8, 2023Updated 2 years ago
pyxis-roc / ptxparser
View on GitHub
A parser for PTX 6.5
☆13Jun 19, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
pku-liang / Hestia
View on GitHub
☆17Mar 26, 2025Updated last year
guqiqi / Samoyeds
View on GitHub
Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores (EuroSys'25)
☆16Jul 17, 2025Updated last year
intel / CacheLib
View on GitHub
Pluggable in-process caching engine to build and scale high performance services
☆19Jul 21, 2026Updated last week
eth-easl / orion
View on GitHub
An interference-aware scheduler for fine-grained GPU sharing
☆164Nov 26, 2025Updated 8 months ago
abhibambhaniya / GenZ-LLM-Analyzer
View on GitHub
LLM Inference analyzer for different hardware platforms
☆121Jun 23, 2026Updated last month
VerticalResearchGroup / Gauntlet
View on GitHub
Research Gauntlet Virtual Brainstorming
☆37Feb 14, 2026Updated 5 months ago
msr-fiddle / blox
View on GitHub
☆47Jul 4, 2024Updated 2 years ago
NMSU-PEARL / GPUs-Energy
View on GitHub
[CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs
☆15Dec 11, 2020Updated 5 years ago
sunlex0717 / DissectingTensorCores
View on GitHub
☆115Apr 19, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sarchlab / triosim
View on GitHub
☆42Jul 2, 2026Updated 3 weeks ago
open-neutrino / neutrino
View on GitHub
☆264Dec 25, 2025Updated 7 months ago
ShihengCao / AMALi
View on GitHub
A GPU analytical model for LLM inference [ISCA 25]
☆20May 4, 2026Updated 2 months ago
aliyun / SimAI
View on GitHub
☆1,035Apr 24, 2026Updated 3 months ago
turbo0628 / Taichi-MPI
View on GitHub
The Taichi MPI demos with MPI4Py
☆13Nov 3, 2022Updated 3 years ago
PrincetonUniversity / ttm-cas
View on GitHub
☆16Oct 15, 2023Updated 2 years ago
gcoe-dresden / cuda-gpu-tlb
View on GitHub
TLB Benchmarks
☆35Sep 11, 2017Updated 8 years ago
microsoft / sarathi-serve
View on GitHub
A low-latency & high-throughput serving engine for LLMs
☆511Jan 8, 2026Updated 6 months ago
LLMServe / DistServe
View on GitHub
Disaggregated serving system for Large Language Models (LLMs).
☆826Apr 6, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ritikraj7 / cpu-centric-agentic-ai
View on GitHub
A comprehensive benchmarking framework for evaluating and optimizing CPU-centric agentic AI systems across multiple workloads, reproducin…
☆49Feb 12, 2026Updated 5 months ago
zhang677 / PCL-lite
View on GitHub
[ICML 2025] Adaptive Self-improvement LLM Agentic System for ML Library Development
☆17Jan 6, 2026Updated 6 months ago
accel-sim / accel-sim-framework
View on GitHub
This is the top-level repository for the Accel-Sim framework.
☆631Mar 24, 2026Updated 4 months ago
spcl / atlahs
View on GitHub
ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage
☆97May 12, 2026Updated 2 months ago
ml-energy / zeus
View on GitHub
Measure and optimize the energy consumption of your AI applications!
☆371Jul 7, 2026Updated 3 weeks ago
arjundevraj / stragglar
View on GitHub
☆15Oct 2, 2025Updated 9 months ago
hahnyuan / LLM-Viewer
View on GitHub
Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline mod…
☆665Sep 11, 2024Updated last year