jychen21/Habana-LLM-Viewer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jychen21/Habana-LLM-Viewer)

jychen21 / Habana-LLM-Viewer

☆13

Alternatives and similar repositories for Habana-LLM-Viewer

Users that are interested in Habana-LLM-Viewer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LesleyLai / CUDA-flocking-boid
View on GitHub
☆14Dec 29, 2020Updated 5 years ago
UNITES-Lab / Occult
View on GitHub
[ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…
☆13Apr 17, 2025Updated last year
KiranThomasCherian / VLSI-and-Computer-Architecture
View on GitHub
Computer Architecture -VLSI -Verilog Codes-Xilinx-Irsim
☆14May 8, 2021Updated 5 years ago
GeeeekExplorer / kkbot
View on GitHub
A Feishu/Lark AI agent bot
☆15Feb 27, 2026Updated 4 months ago
vllm-project / vllm-nccl
View on GitHub
Manages vllm-nccl dependency
☆18Jun 3, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xiatwhu / baidu_topk
View on GitHub
☆15Dec 1, 2023Updated 2 years ago
crab0314 / AudioPlayCache
View on GitHub
use ExoPlayer with AndroidVideoCache to play music, draw my own UI
☆10Apr 28, 2018Updated 8 years ago
mayerrn / two_phase_streaming
View on GitHub
☆11Mar 23, 2022Updated 4 years ago
lukedodd / JitCalc
View on GitHub
Mathematical expression evaluator with just in time code generation.
☆12Apr 7, 2013Updated 13 years ago
leftvalue / NeteaseApi
View on GitHub
网易云音乐 api(第三方)
☆13Sep 1, 2018Updated 7 years ago
estwings57 / HMC-MAC
View on GitHub
Processing-in Memory Architecture for Multiply-Accumulate Operations with Hybrid Memory Cube
☆12Feb 13, 2017Updated 9 years ago
PASSIONLab / distributed_sddmm
View on GitHub
Distributed SDDMM Kernel
☆12Jul 8, 2022Updated 4 years ago
PKUZHOU / GNNear-PACT-2022
View on GitHub
GNNear: Accelerating Full-Batch Training of Graph NeuralNetworks with Near-Memory Processing
☆17Sep 15, 2022Updated 3 years ago
iitm-sysdl / FuSeConv
View on GitHub
Code for paper "FuSeConv Fully Separable Convolutions for Fast Inference on Systolic Arrays" published at DATE 2021
☆18Aug 23, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
DataXujing / TensorRT-LLM-ChatGLM3
View on GitHub
大模型部署实战：TensorRT-LLM, Triton Inference Server, vLLM
☆27Feb 26, 2024Updated 2 years ago
StanfordVLSI / FP-Gen
View on GitHub
FPU Generator
☆20Jul 16, 2026Updated last week
Luca-Dalmasso / matrixTransposeCUDA
View on GitHub
CUDA C simple application for Nvidia's GPU
☆11Jun 7, 2022Updated 4 years ago
PhantomThief / jedis-helper
View on GitHub
☆12Mar 31, 2021Updated 5 years ago
adervay1 / CIMulator
View on GitHub
a Computing In Memory emULATOR framework
☆16May 19, 2024Updated 2 years ago
yiqiaowang / learning
View on GitHub
☆12Jun 3, 2019Updated 7 years ago
JiangLiSJTU / token-ring
View on GitHub
☆13Jan 7, 2025Updated last year
fw-ai / llama-cuda-graph-example
View on GitHub
Example of applying CUDA graphs to LLaMA-v2
☆11Aug 25, 2023Updated 2 years ago
madsys-dev / deepseekv2-profile
View on GitHub
☆156Mar 4, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
TaoLv / mxProfileParser
View on GitHub
A simple tool for parsing the profile.json file of mxnet
☆14Aug 1, 2018Updated 7 years ago
infinigence / FlashOverlap
View on GitHub
A lightweight design for computation-communication overlap.
☆242Jan 20, 2026Updated 6 months ago
NVIDIA / nvbench_demo
View on GitHub
Simple starter CMake project that uses NVBench.
☆15May 6, 2025Updated last year
mayerrn / hybrid_edge_partitioner
View on GitHub
☆19Sep 17, 2021Updated 4 years ago
FanosResearch / OpenDRAM
View on GitHub
☆21Jul 16, 2026Updated last week
coolceph / bhook
View on GitHub
Baidu Hook
☆13Jan 7, 2016Updated 10 years ago
platformxlab / LeaFTL
View on GitHub
☆26Jan 10, 2023Updated 3 years ago
7bvcxz / PIMsim
View on GitHub
☆20Jun 1, 2023Updated 3 years ago
Leo9660 / HedraRAG_AE
View on GitHub
Artifact Evaluation for SOSP 2025
☆21Aug 16, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CMU-SAFARI / Cache-Memory-Hog
View on GitHub
Cache and main memory hog programs. These are programs with specific access patterns to evict the already existing cache blocks of variou…
☆19Nov 2, 2016Updated 9 years ago
xinhao-luo / ClusterFusion
View on GitHub
[NeurIPS 2025] ClusterFusion: Expanding Operator Fusion Scope for LLM Inference via Cluster-Level Collective Primitive
☆75Dec 11, 2025Updated 7 months ago
jhson989 / cuda-ptx
View on GitHub
Inline PTX Assembly in CUDA example
☆15May 7, 2022Updated 4 years ago
daochenzha / neuroshard
View on GitHub
[MLSys 2023] Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
☆16May 5, 2023Updated 3 years ago
thu-nics / UniNDP
View on GitHub
Github repository of HPCA 2025 paper "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"
☆23Jan 18, 2026Updated 6 months ago
intel / flexmalloc
View on GitHub
Flexible memory allocation tool for multi-tiered memory systems
☆15Jan 7, 2026Updated 6 months ago
johnhany / awesome-list
View on GitHub
A list of useful stuff in Machine Learning, Computer Graphics, Software Development, ...
☆18Nov 14, 2022Updated 3 years ago