📖 从零基础到面试通关 —— 22节课彻底搞懂大语言模型 | Learn MiniMind: 系统化学习LLM训练全流程
☆199Apr 1, 2026Updated 3 weeks ago
Alternatives and similar repositories for learn-minimind
Users that are interested in learn-minimind are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AI Agent 面试全攻略:从零到Offer,包含200+面试题、企业级项目(Python/Java/Go)、简历模板、STAR面试稿、哆啦A梦漫画图解☆642Apr 1, 2026Updated 3 weeks ago
- This Project is a music recommender based on the spotify's music database☆13Sep 4, 2022Updated 3 years ago
- ☆15Apr 11, 2024Updated 2 years ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year
- Learning and buiding API using Fast API☆16Aug 7, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆23Dec 16, 2022Updated 3 years ago
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Sep 19, 2024Updated last year
- linux 内核技术文档☆16Feb 26, 2026Updated 2 months ago
- ☆21Jun 9, 2025Updated 10 months ago
- Code for the ACL 2022 (Long paper): "New Intent Discovery with Pre-training and Contrastive Learning".☆14Jul 18, 2022Updated 3 years ago
- ☆17Apr 15, 2025Updated last year
- This module collects per-page stats and decide for each page if it should be migrated, replicated or interleaved.☆17Sep 29, 2015Updated 10 years ago
- 2023年iThome鐵人賽「AI & Data」組佳作【30天內成為NLP大師:掌握關鍵工具和技巧】完整程式碼,該文章會從零開始教你該如何微調大型語言模型☆18Nov 21, 2024Updated last year
- ChatGPT-related papers☆15Mar 31, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆18Dec 20, 2023Updated 2 years ago
- ☆26Sep 16, 2025Updated 7 months ago
- [CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…☆32Apr 16, 2025Updated last year
- Explore Inter-layer Expert Affinity in MoE Model Inference☆16May 6, 2024Updated last year
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆12Oct 27, 2024Updated last year
- A Compute Express Link (CXL) Benchmark Suite☆21Feb 12, 2025Updated last year
- Code of the Grounded MUIE model, REAMO☆10Dec 3, 2024Updated last year
- ☆12Apr 25, 2024Updated 2 years ago
- This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"☆10Feb 16, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ACL 2026 Main] Analytical FFN-to-MoE Restructuring via Activation Pattern Analysis☆38Updated this week
- [NeurIPS 2024] Implementation of paper - D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models☆23Apr 9, 2025Updated last year
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆24Nov 21, 2024Updated last year
- Sources for the Multi-Clock system as described in the paper: MULTI-CLOCK: Dynamic Tiering for Hybrid Memory Systems, HPCA 2022.☆20Mar 21, 2022Updated 4 years ago
- Make one prompt become an immersive, production‑ready experience: a single pipeline for Text → Image → Music → Lights → Video, with real …☆70Sep 5, 2025Updated 7 months ago
- hot page accounting and migration☆26Apr 23, 2019Updated 7 years ago
- ☆26Mar 31, 2022Updated 4 years ago
- ☆58Nov 28, 2025Updated 5 months ago
- Building BERT Model with PyTorch☆23Dec 9, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- URS Benchmark: Evaluating LLMs on User Reported Scenarios☆31May 30, 2025Updated 10 months ago
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- ☆29Dec 15, 2021Updated 4 years ago
- Artifacts of EuroSys'24 paper "Exploring Performance and Cost Optimization with ASIC-Based CXL Memory"☆31Feb 21, 2024Updated 2 years ago
- softmax, crf, biaffine, globalpointer, efficient globalpointer, ricon☆17Oct 11, 2022Updated 3 years ago
- ☆38Mar 17, 2025Updated last year
- A desktop app that lets you read books sneakily at work. 摸鱼看小说工具。☆14Mar 19, 2025Updated last year