chenzomi12 / DeepLearningSystem
AI Infra主要是指AI的基础建设,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。
☆157Updated 5 months ago
Related projects: ⓘ
- ☆572Updated last month
- A LLM semantic caching system aiming to enhance user experience by reducing response time via cached query-result pairs.☆766Updated last week
- 金融财报问答大模型LLM☆190Updated 6 months ago
- AIFoundation 主要是指AI系统遇到大模型,从底层到上层如何系统级地支持大模型训练和推理,全栈的核心技术。☆178Updated this week
- LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)☆350Updated 11 months ago
- bert4torch底层训练框架,用keras风格写torch代码☆73Updated last week
- LLM101n: Let's build a Storyteller 中文版☆113Updated last month
- Tutorials for writing high-performance GPU operators in AI frameworks.☆118Updated last year
- learning how CUDA works☆150Updated last month
- ☆284Updated 2 months ago
- ☆159Updated 3 weeks ago
- ☆251Updated last week
- 友谊魔兽 wow game server http://kkwww.com☆140Updated 3 months ago
- flash attention tutorial written in python, triton, cuda, cutlass☆159Updated 3 months ago
- A toolbox for deep learning model deployment using C++ YoloX | YoloV7 | YoloV8 | Gan | OCR | MobileVit | Scrfd | MobileSAM | StableDiffus…☆443Updated 3 months ago
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama的大模型推理框架。☆170Updated this week
- Toolkit for Prompt Compression☆240Updated 5 months ago
- 中文版 llm-numbers☆94Updated 8 months ago
- Go Out出海第一步,搞定工具库 独立开发者出海技术栈和工具, 收集的一些有用的出海工具和资源,可以帮助你更好地了解和进入海外市场。 挑选标准 帮助独立开发者提升开发效率 帮助独立开发者降低成本 市场上足够流行☆19Updated 3 months ago
- C++企业微信消息推送服务器 (可接入ChatGPT)☆37Updated last year
- ☆56Updated last week
- 模型压缩的小白入门教程☆135Updated this week
- 🔥 基于 Vue3 + Unocss + Naive UI 的轻量简洁的后台管理模板☆134Updated 9 months ago
- ☆11Updated 6 months ago
- This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit…☆227Updated this week
- FlagGems is an operator library for large language models implemented in Triton Language.☆246Updated last week
- HugNLP is a unified and comprehensive NLP library based on HuggingFace Transformer. Please hugging for NLP now!😊 HugNLP will released to…☆250Updated last year
- LLM/MLOps/LLMOps☆44Updated last week
- 看图学大模型☆148Updated last month
- A collection of memory efficient attention operators implemented in the Triton language.☆205Updated 3 months ago