🎓 系统性大语言模型构建课程|🛠️ 覆盖预训练数据工程、Tokenizer、Transformer、MoE、GPU 编程 (CUDA/Triton)、分布式训练、Scaling Laws、推理优化及对齐 (SFT/RLHF/GRPO)|🚀 6 个渐进式作业 + 代码驱动,建立 LLM 全栈认知体系
☆479Apr 12, 2026Updated last week
Alternatives and similar repositories for diy-llm
Users that are interested in diy-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Music Language Model Generation, Optimization, and Practice☆51Apr 10, 2026Updated last week
- 从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/☆652Apr 2, 2026Updated 2 weeks ago
- Official implementation of the paper "Embed Progressive Implicit Preference in Unified Space for Deep Collaborative Filtering"☆20Jun 22, 2025Updated 9 months ago
- An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models☆15Feb 27, 2025Updated last year
- 一份全栈式大语言模型参考指南,用最简洁的代码帮助你端到端定义模型从零训练到工程落地的每一个细节☆162Jan 15, 2026Updated 3 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- CS341 for Spring 2024☆11Jul 15, 2024Updated last year
- Sequential-Quadratic-Programming Derivative-Free Optimization☆17Dec 26, 2022Updated 3 years ago
- [AAAI 2025] Official code for paper: DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image S…☆18Jun 16, 2025Updated 10 months ago
- ☆15Jan 16, 2024Updated 2 years ago
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆24Apr 26, 2025Updated 11 months ago
- 斯坦福小镇中国版,使用本地模型部署,提示工程中文化,简化流程☆52Oct 16, 2025Updated 6 months ago
- A framework for evolving and testing question-answering datasets with various models.☆23Feb 28, 2024Updated 2 years ago
- PyTorch Implemenation for Neural Graph Collaborative Filtering☆32Jul 6, 2023Updated 2 years ago
- Unified Audio-Visual Perception for Multi-Task Video Localization☆31Apr 19, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NeurIPS 2024] Code, Dataset, Samples for the VATT paper “ Tell What You Hear From What You See - Video to Audio Generation Through Text”☆36Jul 24, 2025Updated 8 months ago
- Deep Generative Models course, 2025☆10Jun 5, 2025Updated 10 months ago
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- ☆40Feb 14, 2026Updated 2 months ago
- [2025 CVPR] Towards Open-Vocabulary Audio-Visual Event Localization