📖 从零基础到面试通关 —— 22节课彻底搞懂大语言模型 | Learn MiniMind: 系统化学习LLM训练全流程
☆272Apr 1, 2026Updated last month
Alternatives and similar repositories for learn-minimind
Users that are interested in learn-minimind are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Jun 10, 2023Updated 2 years ago
- AI Agent 面试全攻略:从零到Offer,包含200+面试题、企业级项目(Python/Java/Go)、简历模板、STAR面试稿、哆啦A梦漫画图解☆905Apr 1, 2026Updated last month
- ☆20Feb 20, 2025Updated last year
- 基于2016年电工杯数学建模竞赛数据集建立的超短期以及短期负荷预测☆18May 4, 2024Updated 2 years ago
- ☆15Apr 11, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Nov 5, 2024Updated last year
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year
- [AAAI 2025] ConVis: Contrastive Decoding with Hallucination Visualization for Mitigating Hallucinations in Multimodal Large Language Mode…☆26Sep 26, 2024Updated last year
- ☆23Dec 16, 2022Updated 3 years ago
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Sep 19, 2024Updated last year
- Tiny ImageNet Classification Exercise with PyTorch☆16Aug 21, 2021Updated 4 years ago
- Official implementation of "Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning", ACL2022 main con…☆14Jul 23, 2022Updated 3 years ago
- linux 内核技术文档☆16Apr 27, 2026Updated 3 weeks ago
- DBPM is a simple algorithm designed as a lightweight plug-in without learnable parameters to enhance the performance of time series contr…☆16Mar 8, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 一个可运行的 `Skill-first + Vector-augmented + LangGraph` RAG 系统,支持多模型厂商、分层记忆和 Web 聊天界面。☆85Mar 24, 2026Updated last month
- ☆21Jun 9, 2025Updated 11 months ago
- Code for the ACL 2022 (Long paper): "New Intent Discovery with Pre-training and Contrastive Learning".☆14Jul 18, 2022Updated 3 years ago
- The pmem.io Website☆17Jan 20, 2026Updated 4 months ago
- This module collects per-page stats and decide for each page if it should be migrated, replicated or interleaved.☆17Sep 29, 2015Updated 10 years ago
- 2023年iThome鐵人賽「AI & Data」組佳作【30天內成為NLP大師:掌握關鍵工具和技巧】完整程式碼,該文章會從零開始教你該如何微調大型語言模型☆18Nov 21, 2024Updated last year
- ChatGPT-related papers☆15May 6, 2026Updated last week
- Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification☆17Jan 8, 2024Updated 2 years ago
- ☆18Dec 20, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆26Sep 16, 2025Updated 8 months ago
- Not just slides - delivery-ready talks☆62Apr 16, 2026Updated last month
- Time Series Contrastive Learning with Information-Aware Augmentations (Code)☆23Mar 21, 2023Updated 3 years ago
- [CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…☆32Apr 16, 2025Updated last year
- ☆14Jul 12, 2023Updated 2 years ago
- Explore Inter-layer Expert Affinity in MoE Model Inference☆16May 6, 2024Updated 2 years ago
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆12Oct 27, 2024Updated last year
- Code of the Grounded MUIE model, REAMO☆10Dec 3, 2024Updated last year
- ☆12Apr 25, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"☆10Feb 16, 2024Updated 2 years ago
- 一个模仿Kafka的简单消息中间件☆14Jun 29, 2022Updated 3 years ago
- [NeurIPS 2024] Implementation of paper - D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models☆23Apr 9, 2025Updated last year
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆24Nov 21, 2024Updated last year
- ☆52Nov 27, 2025Updated 5 months ago
- Sources for the Multi-Clock system as described in the paper: MULTI-CLOCK: Dynamic Tiering for Hybrid Memory Systems, HPCA 2022.☆20Mar 21, 2022Updated 4 years ago
- Make one prompt become an immersive, production‑ready experience: a single pipeline for Text → Image → Music → Lights → Video, with real …☆70Sep 5, 2025Updated 8 months ago