🎓 系统性大语言模型构建课程|🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)|🚀 6 个渐进式作业 + 代码驱动,建立 LLM 全栈认知体系
☆880Jun 6, 2026Updated last week
Alternatives and similar repositories for diy-llm
Users that are interested in diy-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 📚“langent”由“lang”与“agent”合并而来的学习教程☆340Jun 5, 2026Updated last week
- Repo for CS 259D: Data Mining for Cyber Security☆18Dec 12, 2014Updated 11 years ago
- WIPE implementation☆13Nov 26, 2023Updated 2 years ago
- official code of Efficient Depth-Guided Urban View Synthesis☆14Dec 24, 2024Updated last year
- 本系列内容将带领大家从零基础理解 LangChain V1.x 及其相关生态(包括 LangGraph、DeepAgents 等)在当前主流智能体(Agent)开发中的定位与核心能力。将依次完成基础环境准备、依赖安装、入门级 LLM 调用、Prompt 工程实战、Agent…☆89May 17, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation of the paper "Embed Progressive Implicit Preference in Unified Space for Deep Collaborative Filtering"☆20Jun 22, 2025Updated 11 months ago
- ☆15Apr 8, 2025Updated last year
- ☆12Nov 28, 2022Updated 3 years ago
- 武汉大学国家网安院软件安全☆16Dec 9, 2024Updated last year
- [WWW'22] Deep Interest Highlight Network for Click-Through Rate Prediction in Trigger-Induced Recommendation☆22Apr 11, 2022Updated 4 years ago
- A lightweight cross-platform prompt manager for researchers to organize, reuse, and iterate high-quality prompts.☆46Apr 12, 2026Updated 2 months ago
- An reconstruction of RL Introduction and its course materials for a more efficient entry☆21Mar 4, 2026Updated 3 months ago
- 华中科技大学计算机网络实验2019级☆12Oct 24, 2022Updated 3 years ago
- 🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diver…☆212May 19, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 通过动画学强化学习笔记☆67Apr 23, 2026Updated last month
- Sequential-Quadratic-Programming Derivative-Free Optimization☆17Dec 26, 2022Updated 3 years ago
- ☆17Dec 20, 2023Updated 2 years ago
- [AAAI 2025] Official code for paper: DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image S…☆20Jun 16, 2025Updated 11 months ago
- An Approximated Gradient Sign Method Using Differential Evolution For Black-box Adversarial Attack☆11Feb 25, 2022Updated 4 years ago
- The source code of [WWW 2025] MoDiCF☆14Mar 26, 2026Updated 2 months ago
- [NeurIPS '21] Adversarial Attacks on Graph Classification via Bayesian Optimisation (GRABNEL)☆15Nov 21, 2021Updated 4 years ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆26May 27, 2025Updated last year
- [CVPR 2025] Official implementation for the paper"Towards Understanding How Knowledge Evolves in Large Vision-Language Models"☆32Apr 10, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 斯坦福小镇中国版,使用本地模型部署,提示工程中文化,简化流程☆58Oct 16, 2025Updated 7 months ago
- PyTorch Implemenation for Neural Graph Collaborative Filtering☆32Jul 6, 2023Updated 2 years ago
- Multiscale Facial Expression Recognition Based on Dynamic Global and Static Local Attention on 《IEEE Transacions on Affective Computing》 …☆15Jun 3, 2025Updated last year
- AI算法岗求职攻略(涵盖校招时间表、准备攻略(社招和校招)、刷题指南、内推和 AI 公司清单、求职算法必备资料等),算法方向涉及:机器学习、深度学习、计算机视觉、自然语言处理和搜广推等☆26Jun 15, 2024Updated 2 years ago
- ☆13Jan 31, 2023Updated 3 years ago
- Code for ACL 2023 Oral Paper: ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning☆12Aug 23, 2025Updated 9 months ago
- ☆11Nov 18, 2024Updated last year
- Benchmark dataset for the paper "Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with …☆27May 20, 2025Updated last year
- Is Neuron Coverage a Meaningful Measure for Testing Deep Neural Networks? (FSE 2020)☆10Sep 23, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 本项目是关于Harness Engineering的开源教程,旨在帮助开发者理解和掌握在大模型时代,如何为复杂、长时间运行的 AI 智能体(Agent)构建健壮的底层运行架构。☆155Apr 25, 2026Updated last month
- [CVPR 2024] Boosting Adversarial Transferability by Block Shuffle and Rotation☆14Feb 28, 2024Updated 2 years ago
- ☆13Dec 25, 2024Updated last year
- Robust Adversarial Objects against Deep Learning Models☆11Mar 28, 2020Updated 6 years ago
- ☆13Feb 1, 2024Updated 2 years ago
- Code and data for PAN and PAN-phys.☆14Mar 20, 2023Updated 3 years ago
- ☆10Dec 10, 2023Updated 2 years ago