chenzomi12 / DeepLearningSystemLinks
AI Infra主要是指AI的基础建设,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。
☆238Updated last year
Alternatives and similar repositories for DeepLearningSystem
Users that are interested in DeepLearningSystem are comparing it to the libraries listed below
Sorting:
- ☆611Updated 10 months ago
- llm theoretical performance analysis tools and support params, flops, memory and latency analysis.☆96Updated last week
- A light llama-like llm inference framework based on the triton kernel.☆128Updated last week
- Triton Documentation in Chinese Simplified / Triton 中文文档☆71Updated 2 months ago
- UltraScale Playbook 中文版☆43Updated 3 months ago
- 模型压缩的小白入门教程,PDF下载地址 https://github.com/datawhalechina/awesome-compression/releases☆297Updated last week
- [ICCV 2023] RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers☆128Updated last year
- Accelerate inference without tears☆318Updated 3 months ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆57Updated 7 months ago
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.☆100Updated last year
- ☆168Updated this week
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。☆369Updated this week
- Summary of the Specs of Commonly Used GPUs for Training and Inference of LLM☆46Updated 3 months ago
- A tutorial for CUDA&PyTorch☆146Updated 5 months ago
- Tutorials for writing high-performance GPU operators in AI frameworks.☆130Updated last year
- A pupil in the computer world.(Felix Fu)☆238Updated last year
- how to learn PyTorch and OneFlow☆435Updated last year
- ☆336Updated this week
- LLM/MLOps/LLMOps☆92Updated last month
- my cs notes☆51Updated 8 months ago
- FlagScale is a large model toolkit based on open-sourced projects.☆307Updated this week
- ☆51Updated last week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆256Updated 3 weeks ago
- RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions☆138Updated 2 months ago
- ☆28Updated last month
- ☆79Updated last year
- LLM全栈优质资源汇总☆572Updated 7 months ago
- Inference code for LLaMA models☆121Updated last year
- ☆133Updated 4 months ago
- ☆45Updated last year