Ascend / ascend-docker-image

☆22

Alternatives and similar repositories for ascend-docker-image:

Users that are interested in ascend-docker-image are comparing it to the libraries listed below

Ascend / AscendSpeed
☆76Updated last year
feifeibear / LLMRoofline
Compare different hardware platforms via the Roofline Model for LLM inference tasks.
☆93Updated 11 months ago
modelscope / dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …
☆231Updated last week
OpenPPL / ppl.llm.serving
☆127Updated last month
OpenBMB / cpm_kernels
☆23Updated last year
Rayrtfr / FasterTransformer
Transformer related optimization, including BERT, GPT
☆17Updated last year
volcengine / veGiantModel
☆213Updated last year
mindspore-lab / mindpet
☆45Updated 11 months ago
FlagOpen / FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
☆223Updated this week
second-state / meetups
☆69Updated last month
AliyunContainerService / arena
A CLI for Kubeflow.
☆59Updated last year
virtaitech / orion
☆273Updated last year
mindspore-lab / mindformers
☆153Updated this week
alibaba / EasyParallelLibrary
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
☆266Updated last year
Rayrtfr / fastertransformer_backend
☆9Updated last year
zw0610 / zw0610.github.io
☆58Updated 4 years ago
01-ai / Descartes
☆107Updated 10 months ago
alibaba / ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
☆304Updated last week
hyperai / triton-cn
Triton Documentation in Chinese Simplified / Triton 中文文档
☆54Updated last month
DeepLink-org / dlinfer
☆40Updated this week
HFAiLab / hai-platform-studio
配合 HAI Platform 使用的集成化用户界面
☆35Updated last year
THUDM / FasterTransformer
Transformer related optimization, including BERT, GPT
☆39Updated 2 years ago
zms1999 / SmartMoE
A MoE impl for PyTorch, [ATC'23] SmartMoE
☆61Updated last year
OpenPPL / ppl.nn.llm
☆140Updated 9 months ago
sophgo / ChatGLM2-TPU
run ChatGLM2-6B in BM1684X
☆49Updated 11 months ago
madsys-dev / deepseekv2-profile
☆104Updated 6 months ago
void-main / fastertransformer_backend
☆21Updated last year
ninehills / llm-inference-benchmark
LLM Inference benchmark
☆394Updated 6 months ago
ForceInjection / AI-fundermentals
AI 基础知识 - GPU 架构、CUDA 编程以及大模型基础知识
☆70Updated this week