🎓 系统性大语言模型构建课程|🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)|🚀 6 个渐进式作业 + 代码驱动,建立 LLM 全栈认知体系
☆249Mar 28, 2026Updated this week
Alternatives and similar repositories for diy-llm
Users that are interested in diy-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenClaw 学习教程 - 一周打造跨设备 AI 助手☆89Mar 12, 2026Updated 2 weeks ago
- 武汉大学国家网络安全学院2021级操作系统期末大实验☆12Jan 2, 2024Updated 2 years ago
- Project mangement summary set. 项目管理经验总结。 针对嵌入式、软件开发相关的技术管理 TM、项目管理 PM、配置管理(CM)、知识管理 KM、敏捷开发APM、DevSecOps 实践、认证-标准(ISO26262)、软件全生命周期管理(需求 …☆14Dec 11, 2022Updated 3 years ago
- An reconstruction of RL Introduction and its course materials for a more efficient entry☆21Mar 4, 2026Updated 3 weeks ago
- Datawhale开源教程 Bishop 深度学习理论和方法讲解☆36Jan 8, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/☆514Updated this week
- 学习他人如何制作漂亮的notebook。「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。☆10Sep 24, 2021Updated 4 years ago
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- AI算法岗求职攻略(涵盖校招时间表、准备攻略(社招和校招)、刷题指南、内推和 AI 公司清单、求职算法必备资料等),算法方向涉及:机器学习、深度学习、计算机视觉、自然语言处理和搜广推等☆25Jun 15, 2024Updated last year
- Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".☆11Oct 25, 2022Updated 3 years ago
- 🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!☆55Aug 12, 2025Updated 7 months ago
- 针对NEU-DET数据集的钢材缺陷检测☆30Jun 23, 2023Updated 2 years ago
- DataFountain第五届达观杯第4名方案☆11Dec 3, 2021Updated 4 years ago
- Multi-Figurative Language Generation (COLING 2022)☆12Jan 30, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 基于CNN的图像验证码识别,单个验证码识别成功率99%☆27Oct 3, 2023Updated 2 years ago
- 文言文信息抽取(实体识别+关系抽取)☆10Feb 24, 2023Updated 3 years ago
- An implementation of RaFM. Xiaoshuang Chen, Yin Zheng, Jiaxiang Wang, et al. "RaFM: Rank-Aware Factorization Machines"☆12May 11, 2019Updated 6 years ago
- ☆34Jan 4, 2023Updated 3 years ago
- ☆10Jan 5, 2021Updated 5 years ago
- Implementation of Variational Auto-Encoder for text generation in pytorch.☆12Oct 9, 2020Updated 5 years ago
- ☆92Jul 24, 2025Updated 8 months ago
- Official code for the CVPR 2020 paper "Shoestring: Graph-Based Semi-Supervised Classification with Severely Limited Labeled Data."☆10Sep 23, 2020Updated 5 years ago
- 科大讯飞多模态RAG图文问答挑战赛☆62Aug 4, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A simple middleware to improving GPU utilization then speedup online inference.☆19Feb 22, 2021Updated 5 years ago
- ☆14Dec 26, 2022Updated 3 years ago
- ☆20Nov 6, 2023Updated 2 years ago
- A useful toolbox for research.☆33Jul 10, 2024Updated last year
- Amazon Recommendation System build on BPR TensorFlow implementation☆16Jun 10, 2017Updated 8 years ago
- Sketch Driven Regular Expression Generation.☆16Apr 26, 2023Updated 2 years ago
- 根据正则表达式生成其对应 DFA 的状态转移图☆15Nov 20, 2018Updated 7 years ago
- ☆21Aug 16, 2023Updated 2 years ago
- Our solutions for KDDCup2019 as team "admin"☆14Jul 20, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 2018年中国机器人技能大赛Nao机器人舞蹈(高校组)季军☆13Jan 20, 2020Updated 6 years ago
- A comparison of pretraining framework for LLM☆22Feb 6, 2025Updated last year
- Character-based seq2seq models (english => predicate logic)☆16Dec 26, 2020Updated 5 years ago
- ☆18Nov 19, 2017Updated 8 years ago
- OneSug☆26Nov 13, 2025Updated 4 months ago
- Code for KE-Blender, EMNLP 2021☆18Mar 1, 2022Updated 4 years ago
- The source code of Paper "PathQG: Neural Question Generation from Facts".☆23Jan 4, 2021Updated 5 years ago