🎓从0开始训练一个大模型Minimind项目的超详细解析,包括但不限于用到的架构,算法,以及大模型面试经验
☆458Mar 22, 2026Updated last week
Alternatives and similar repositories for from-minimind-to-more
Users that are interested in from-minimind-to-more are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated last year
- Edge-oriented Point cloud Transformer for 3D Intracranial Aneurysm Segmentation. MICCAI22☆13Aug 18, 2022Updated 3 years ago
- 基于qwen3的医疗大模型研发全流程 0.分词训练 1.增量预训练 2.微调 3.强化 4.量化 5.蒸馏 6.评估 7.lora模型合并 8.服务 9.部署☆37Jan 3, 2026Updated 2 months ago
- 北邮-软件工程-充电桩管理系统☆17Mar 15, 2024Updated 2 years ago
- 基于springboot的停车位管理系统:前端 thymeleaf、Jquery、bootstrap,后端 Springboot、Mybatis,系统角色分为:用户、管理员,管理员在管理后台录入车位信息,用户在线查找车位、预约车位,解决停车找车位烦恼☆25Jan 26, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 大连理工大学编译原理课程设计☆11Jan 1, 2024Updated 2 years ago
- ZJU CS 数字逻辑设计 计算机组成 计算机体系结构 实验报告☆27Jan 17, 2024Updated 2 years ago
- Locality-Aware Hyperspectral Classification☆18Jan 10, 2025Updated last year
- 一个极简高效的思维导图☆38Feb 8, 2026Updated last month
- VCampus 虚拟校园 / 东南大学《软件实践》项目☆13Jul 5, 2021Updated 4 years ago
- ☆19Dec 22, 2024Updated last year
- An LLM training framework built from the ground up, featuring a custom BumbleBee architecture and end-to-end support for multiple open-so…☆63Feb 9, 2026Updated last month
- This is the repository for the CONFLARE (CONformal LArge language model REtrieval) Python package.☆22Apr 19, 2024Updated last year
- my solutions of problems in mit 18.06 Linear Algebra.☆14Jul 18, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 武汉理工大学“数据库系统综合实验”课程实验项目源码。选题为《教学管理信息系统的设计与实现》。☆17Apr 20, 2021Updated 4 years ago
- 哈尔滨工程大学编译原理课程设计☆15Dec 17, 2019Updated 6 years ago
- 一款支持ChatGPT+智谱AI+讯飞星火+书生浦语大模型+Kimi.ai+MoonshotAI+豆包AI等大模型的AIGC源码。全网最易部署,响应速度最快的AIGC环境。PHP版调用各种模型接口进行问答和对话,采用Stream流模式通信,一边生成一边输出。前 端采用Even…☆17Nov 10, 2024Updated last year
- cpp实现数据库和数据结构大作业:图书管理系统☆45Feb 19, 2019Updated 7 years ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 7 months ago
- 整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用, 数据集与教程等。☆31Sep 19, 2024Updated last year
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- ☆14Feb 9, 2025Updated last year
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research☆16Dec 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 江西师范大学操作系统课实验作业代码☆13May 9, 2025Updated 10 months ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆24May 27, 2025Updated 10 months ago
- 武汉理工大学保研数据分析☆17May 19, 2023Updated 2 years ago
- 数据库内核笔记☆13Aug 18, 2022Updated 3 years ago
- Full Marks | Auditing CS61B Data Structures, Spring 2021☆13Jul 31, 2023Updated 2 years ago
- 起点本章说☆23Jan 29, 2024Updated 2 years ago
- My assignment solutions for Michigan’s EECS 498-008/598-008 (Deep Learning for Computer Vision) by Prof. Justin Johnson, version 2022☆20Mar 29, 2022Updated 4 years ago
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 7 months ago
- Code implementation for 《Large AI Model Empowered Multimodal Semantic communication》☆24Jul 4, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [AAAI'25] SPRING: Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models☆26Sep 24, 2025Updated 6 months ago
- ☆18Apr 19, 2021Updated 4 years ago
- Adding random noise to a text dataset, and controlling very accurately the quality of the result☆20Updated this week
- ☆11May 2, 2023Updated 2 years ago
- 华为杯研究生数学建模竞赛:历年来数据分析类代码(不定时更新,曾获一等奖)☆132Jan 31, 2026Updated last month
- ☆13Jan 16, 2025Updated last year
- HyperMamba: A Spectral-Spatial Adaptive Mamba for Hyperspectral Image Classification☆35Nov 8, 2024Updated last year