📖 从零基础到面试通关 —— 22节课彻底搞懂大语言模型 | Learn MiniMind: 系统化学习LLM训练全流程
☆66Apr 1, 2026Updated last week
Alternatives and similar repositories for learn-minimind
Users that are interested in learn-minimind are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Apr 11, 2024Updated last year
- 本项目是一个基于 LangChain/LangGraph 的 ReAct Agent 系统,专为“智扫通”扫地机器人提供智能客服功能,集成了 RAG 知识库问答、动态提示词切换(普通模式 vs 报告生成模式)、Streamlit 流式聊天界面、Chroma 向量存储以及外部…☆60Mar 25, 2026Updated 2 weeks ago
- ☆63Mar 26, 2026Updated 2 weeks ago
- ☆20Feb 8, 2024Updated 2 years ago
- 一个可运行的 `Skill-first + Vector-augmented + LangGraph` RAG 系统,支持多模型厂商、分层记忆和 Web 聊天界面。☆73Mar 24, 2026Updated 2 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Learning and buiding API using Fast API☆16Aug 7, 2021Updated 4 years ago
- ☆17Sep 9, 2025Updated 7 months ago
- Langchain tool for time series forecasting☆20Jun 5, 2023Updated 2 years ago
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Sep 19, 2024Updated last year
- Tiny ImageNet Classification Exercise with PyTorch☆16Aug 21, 2021Updated 4 years ago
- ☆19Mar 21, 2024Updated 2 years ago
- Official implementation of "Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning", ACL2022 main con…☆14Jul 23, 2022Updated 3 years ago
- DBPM is a simple algorithm designed as a lightweight plug-in without learnable parameters to enhance the performance of time series contr…☆17Mar 8, 2024Updated 2 years ago
- ☆20Jun 9, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆28Aug 22, 2025Updated 7 months ago
- Official Implementation of "A Hybrid Architecture for Out of Domain Intent Detection and Intent Discovery"☆11May 31, 2023Updated 2 years ago
- ☆25Sep 16, 2025Updated 6 months ago
- Not just slides - delivery-ready talks☆58Mar 11, 2026Updated 3 weeks ago
- 2023年iThome鐵人賽「AI & Data」組佳作【30天內成為NLP大師:掌握關鍵工具和技巧】完整程式碼,該文章會從零開始教你該如何微調大型語言模型☆18Nov 21, 2024Updated last year
- ChatGPT-related papers☆15Mar 31, 2026Updated last week
- Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent Classification☆16Jan 8, 2024Updated 2 years ago
- ☆18Dec 20, 2023Updated 2 years ago
- Time Series Contrastive Learning with Information-Aware Augmentations (Code)☆24Mar 21, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 使用biaffine的中文命名实体识别☆10Jan 12, 2023Updated 3 years ago
- Notes about courses Machine Learning 2025 Spring by Hung-yi Lee☆27Sep 22, 2025Updated 6 months ago
- ☆20Sep 24, 2025Updated 6 months ago
- Explore Inter-layer Expert Affinity in MoE Model Inference☆16May 6, 2024Updated last year
- ⛏️This is the storage of my Slides、Reports and Papers. | 存储PPT、报告和论文☆12Oct 27, 2024Updated last year
- 《动手学机器学习》习题解答☆90Jan 15, 2026Updated 2 months ago
- Code of the Grounded MUIE model, REAMO☆11Dec 3, 2024Updated last year
- ☆12Apr 25, 2024Updated last year
- ☆38Mar 2, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"☆10Feb 16, 2024Updated 2 years ago
- Source separation of underwater acoustic radiated noise signals from ships with unknown numbers of signals. Using keras 2.2.4 with tenso…☆21Dec 16, 2025Updated 3 months ago
- [NeurIPS 2024] Implementation of paper - D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models☆23Apr 9, 2025Updated last year
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆24Nov 21, 2024Updated last year
- 基于python3训练中文wiki词向量、字向量、拼音向量☆11Jan 2, 2022Updated 4 years ago
- Sources for the Multi-Clock system as described in the paper: MULTI-CLOCK: Dynamic Tiering for Hybrid Memory Systems, HPCA 2022.☆19Mar 21, 2022Updated 4 years ago
- ☆26Mar 31, 2022Updated 4 years ago