FlyAIBox / dcu-in-actionLinks

国产加速卡-海光DCU实战（大模型训练、微调、推理等）

☆50

Alternatives and similar repositories for dcu-in-action

Users that are interested in dcu-in-action are comparing it to the libraries listed below

Sorting:

OpenCSGs / llm-inference
llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…
☆86Updated last year
vllm-project / vllm-nccl
Manages vllm-nccl dependency
☆17Updated last year
DeepLink-org / DeepTrace
DeepTrace: A lightweight, scalable real-time diagnostic and analysis tool for distributed training tasks.
☆15Updated last month
HFAiLab / hai-platform-studio
配合 HAI Platform 使用的集成化用户界面
☆53Updated 2 years ago
zhaochenyang20 / ModelServer
Efficient, Flexible, and Highly Fault-Tolerant Model Service Management Based on SGLang
☆58Updated 11 months ago
hellangleZ / Qwen3_autothink_adapter
Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…
☆22Updated 5 months ago
thunlp / Delta-CoMe
Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024
☆57Updated 11 months ago
OpenBMB / CPM.cu
CPM.cu is a lightweight, high-performance CUDA implementation for LLMs, optimized for end-device inference and featuring cutting-edge tec…
☆198Updated last week
OpenCSGs / llm-finetune
The framework of training large language models，support lora, full parameters fine tune etc, define yaml to start training/fine tune of y…
☆30Updated last year
Tencent / AngelSlim
Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
☆178Updated this week
tile-ai / tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆19Updated this week
SkyworkAI / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆16Updated last year
Rayrtfr / FasterTransformer
Transformer related optimization, including BERT, GPT
☆17Updated 2 years ago
Laoyu84 / 4onebench
A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.
☆26Updated 10 months ago
sgl-project / tensorrt-demo
TensorRT LLM Benchmark Configuration
☆13Updated last year
bentoml / llm-bench
☆56Updated 11 months ago
ArtificialZeng / transformers-Explained
官方transformers源码解析。AI大模型时代，pytorch、transformer是新操作系统，其他都是运行在其上面的软件。
☆17Updated 2 years ago
tensorchord / deepseek-api-arena
A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.
☆29Updated 6 months ago
reilxlx / llava-Qwen2-7B-Instruct-Chinese-CLIP
模型 llava-Qwen2-7B-Instruct-Chinese-CLIP 增强中文文字识别能力和表情包内涵识别能力，接近gpt4o、claude-3.5-sonnet的识别水平！
☆25Updated last year
pandada8 / llm-inference-benchmark
LLM 推理服务性能测试
☆43Updated last year
OpenBMB / cpm_kernels
☆25Updated 2 years ago
LLM-inference-router / vllm-router
vLLM Router
☆44Updated last year
limafang / Xtuner-GUI
Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…
☆13Updated last year
opendatahub-io / vllm-tgis-adapter
vLLM adapter for a TGIS-compatible gRPC server.
☆41Updated this week
WALLE-AI / uReasoningLLMs
Deepseek-r1复现科普与资源汇总
☆22Updated 7 months ago
MooreThreads / TurboRAG
☆82Updated 10 months ago
modelscope / dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …
☆266Updated 2 months ago
zai-org / GLM-Edge
GLM Series Edge Models
☆149Updated 4 months ago
woct0rdho / transformers-qwen3-moe-fused
Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth
☆191Updated last week
01-ai / Descartes
☆112Updated last year