Ascend / ascend-docker-image
☆22Updated last year
Alternatives and similar repositories for ascend-docker-image:
Users that are interested in ascend-docker-image are comparing it to the libraries listed below
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.☆78Updated 9 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆139Updated this week
- ☆75Updated last year
- A CLI for Kubeflow.☆59Updated 11 months ago
- ☆126Updated 3 weeks ago
- ☆150Updated last week
- ☆45Updated 9 months ago
- ☆56Updated 4 years ago
- ☆156Updated this week
- Tools for monitoring NVIDIA GPUs on Linux☆9Updated 4 years ago
- ☆69Updated last year
- ☆101Updated 8 months ago
- Transformer related optimization, including BERT, GPT☆17Updated last year
- export llama to onnx☆103Updated 6 months ago
- ☆295Updated last week
- ☆21Updated last year
- OpenAIOS is an incubating open-source distributed OS kernel based on Kubernetes for AI workloads. OpenAIOS-Platform is an AI development…☆95Updated 3 years ago
- FastNN provides distributed training examples that use EPL.☆83Updated 2 years ago
- ☆210Updated last year
- FlagScale is a large model toolkit based on open-sourced projects.☆195Updated this week
- run ChatGLM2-6B in BM1684X☆48Updated 9 months ago
- ☆23Updated last year
- ☆140Updated 7 months ago
- alibabacloud-aiacc-demo☆42Updated last year
- LLM Inference benchmark☆360Updated 4 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆556Updated 2 months ago
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆121Updated 2 years ago
- ☆116Updated last month
- Device-plugin for volcano vgpu which support hard resource isolation☆54Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆126Updated last week