🎓 系统性大语言模型构建课程|🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)|🚀 6 个渐进式作业 + 代码驱动,建立 LLM 全栈认知体系
☆812May 22, 2026Updated this week
Alternatives and similar repositories for diy-llm
Users that are interested in diy-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of assignments on large language models, serving for both beginners to get started and pros to practice advanced tech.☆46Mar 30, 2025Updated last year
- OpenClaw 学习教程 - 一周打造跨设备 AI 助手☆166Mar 12, 2026Updated 2 months ago
- Official implementation of the paper "Embed Progressive Implicit Preference in Unified Space for Deep Collaborative Filtering"☆20Jun 22, 2025Updated 11 months ago
- Codes of Interpreting Low-level Vision Models with Causal Effect Maps☆35Sep 9, 2025Updated 8 months ago
- A flexible & scalable MLLM-based AIGC detection pipeline☆37Oct 27, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 武汉大学国家网安院软件安全☆16Dec 9, 2024Updated last year
- [WWW'22] Deep Interest Highlight Network for Click-Through Rate Prediction in Trigger-Induced Recommendation☆22Apr 11, 2022Updated 4 years ago
- An reconstruction of RL Introduction and its course materials for a more efficient entry☆21Mar 4, 2026Updated 2 months ago
- 🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diver…☆184May 16, 2026Updated last week
- 一份全栈式大语言模型参考指南,用最简洁的代码帮助你端到端 定义模型从零训练到工程落地的每一个细节☆181Jan 15, 2026Updated 4 months ago
- Sequential-Quadratic-Programming Derivative-Free Optimization☆17Dec 26, 2022Updated 3 years ago
- An Approximated Gradient Sign Method Using Differential Evolution For Black-box Adversarial Attack☆11Feb 25, 2022Updated 4 years ago
- ☆32Dec 14, 2025Updated 5 months ago
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆24Apr 26, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆26May 27, 2025Updated 11 months ago
- Unified Audio-Visual Perception for Multi-Task Video Localization☆31Apr 19, 2024Updated 2 years ago
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- 武汉大学网络安全学院操作系统实践wiki☆19Nov 18, 2021Updated 4 years ago
- AI算法岗求职攻略(涵盖校招时间表、准备攻略(社招和校招)、刷题指南、内推和 AI 公司清单、求职算法必备资料等),算法方向涉及:机器学习、深度学习、计算机视觉、自然语言处理和搜广推等☆26Jun 15, 2024Updated last year
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 9 months ago
- ☆12Sep 22, 2023Updated 2 years ago
- 📄 同济大学本科生毕业设计论文模板 | Tongji University Undergraduate Thesis Template | Typst☆40Dec 7, 2025Updated 5 months ago
- ☆11Nov 18, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".☆12Oct 25, 2022Updated 3 years ago
- [NeurIPS 2025 Spotlight] "Physics-Driven Spatiotemporal Modeling for AI-Generated Video Detection"☆82Nov 23, 2025Updated 6 months ago
- [CVPR 2024] Boosting Adversarial Transferability by Block Shuffle and Rotation☆14Feb 28, 2024Updated 2 years ago
- ☆12Dec 25, 2024Updated last year
- 🏆🏆 「大模型」All in one & All from scratch. 🌍🌍 收集、清洗数据,训练Tokenizer,预训练、SFT、GRPO!☆57Aug 12, 2025Updated 9 months ago
- CS 294-112 @ UCB Deep RL☆30Mar 24, 2023Updated 3 years ago
- Robust Adversarial Objects against Deep Learning Models☆11Mar 28, 2020Updated 6 years ago
- ☆15Dec 20, 2024Updated last year
- ☆13Feb 1, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code and data for PAN and PAN-phys.☆14Mar 20, 2023Updated 3 years ago
- CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors☆15Nov 3, 2024Updated last year
- This project solves linear-quadratic dynamic optimization (LQDO) problems using direct transcription (DT) and quadratic programming (QP)☆24Mar 29, 2025Updated last year
- ☆10Dec 10, 2023Updated 2 years ago
- Multi-Figurative Language Generation (COLING 2022)☆12Jan 30, 2023Updated 3 years ago
- ☆11Oct 8, 2022Updated 3 years ago
- 2nd Place Solution - Kaggle Challenge: Learning Equality - Curriculum Recommendations☆14Mar 28, 2023Updated 3 years ago