InfiniTensor / InfiniLM-RustLinks
☆125Updated last week
Alternatives and similar repositories for InfiniLM-Rust
Users that are interested in InfiniLM-Rust are comparing it to the libraries listed below
Sorting:
- ☆62Updated 10 months ago
- 算子库☆16Updated 2 months ago
- ☆257Updated this week
- easy cuda code☆83Updated 9 months ago
- 笔记☆43Updated last month
- 算子库(Rust)☆14Updated 2 months ago
- ☆36Updated 8 months ago
- ☆65Updated 8 months ago
- 实验:rust 实现 llama2 推理☆16Updated last year
- ☆27Updated 2 weeks ago
- A domain-specific language (DSL) based on Triton but providing higher-level abstractions.☆30Updated this week
- A PyTorch-like deep learning framework. Just for fun.☆155Updated last year
- 分层解耦的深度学习推理引擎☆75Updated 7 months ago
- 《自己动手写AI编译器》☆28Updated 11 months ago
- Wiki fo HPC☆121Updated 2 months ago
- Codes & examples for "CUDA - From Correctness to Performance"☆111Updated 11 months ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆84Updated 5 months ago
- Fast OS-level support for GPU checkpoint and restore☆238Updated this week
- ☆48Updated last year
- CUDA SGEMM optimization note☆13Updated last year
- LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model☆46Updated last month
- ☆13Updated last year
- ☆70Updated 2 years ago
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆111Updated 4 months ago
- gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling☆41Updated this week
- 训练营讲义☆19Updated 8 months ago
- 没分支的 rCore-Tutorial☆46Updated 3 weeks ago
- ☆42Updated last year
- Assignments of Stanford CS110L-2020spr: Safety in Systems Programming☆53Updated 2 years ago
- ☆190Updated last month