openmlsys / openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
☆4,067Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for openmlsys-zh
- System for AI Education Resource.☆3,585Updated 2 weeks ago
- 校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library st…☆2,517Updated 2 weeks ago
- compiler learning resources collect.☆2,138Updated 5 months ago
- how to optimize some algorithm in cuda.☆1,569Updated this week
- ☆588Updated 5 months ago
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,332Updated 3 years ago
- 🎉 Modern CUDA Learn Notes with PyTorch: CUDA Cores, Tensor Cores, fp32/tf32, fp16/bf16, fp8/int8, flash_attn, rope, sgemm, hgemm, sgemv,…☆1,384Updated this week
- 强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/☆9,467Updated this week
- AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术☆11,095Updated 2 weeks ago
- ☆2,187Updated 9 months ago
- 🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Mod…☆2,685Updated 2 months ago
- OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.☆5,903Updated this week
- This is a Chinese translation of the CUDA programming guide☆1,260Updated last year
- 推荐系统入门教程,在线阅读地址:https://datawhalechina.github.io/fun-rec/☆4,483Updated 4 months ago
- real Transformer TeraFLOPS on various GPUs☆873Updated 9 months ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆815Updated 2 months ago
- The road to hack SysML and become an system expert☆432Updated last month
- AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目☆1,706Updated 3 weeks ago
- A simple deep learning framework in pure python for purpose of learning in DL☆428Updated 2 years ago
- A primitive library for neural network☆1,291Updated this week
- 高性能并行编程与优化 - 课件☆3,737Updated 3 weeks ago
- ☆2,425Updated 9 months ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题☆3,536Updated 2 weeks ago
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,548Updated 7 months ago
- Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.☆3,087Updated this week
- hpc-learning☆585Updated 5 months ago
- 深度学习经典、新论文逐段精读☆26,994Updated 3 months ago
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,198Updated last year
- 高性能计算相关知识学习笔记,包含学习笔记和相关知识的代码demo,在持续完善中。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!☆370Updated last year
- 《机器学习理论导引》(宝箱书)的证明、案例、概念补充与参考文献讲解。☆1,554Updated this week