🎓从0开始训练一个大模型Minimind项目的超详细解析,包括但不限于用到的架构,算法,以及大模型面试经验
☆703Apr 17, 2026Updated 2 weeks ago
Alternatives and similar repositories for from-minimind-to-more
Users that are interested in from-minimind-to-more are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…☆10Jul 21, 2023Updated 2 years ago
- Official Repo for the NeurIPS2024 spotlight paper "Are Graph Neural Networks Optimal Approximation Algorithms?"☆16Apr 16, 2026Updated 2 weeks ago
- langchain-study☆27Apr 11, 2026Updated 2 weeks ago
- ACwing算法基础课笔记☆11May 15, 2023Updated 2 years ago
- 基于轻量级 LLM 与 Qwen2.5-1.5B 两条主线,完成从数据处理、模型训练、参数高效微调,到评测验证与服务部署的端到端闭环。☆104Apr 21, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 归宿2.0版本,JavaEE和软件工程课程项目☆11Mar 11, 2022Updated 4 years ago
- 李宏毅机器学习2021笔记☆14Nov 27, 2022Updated 3 years ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated 2 years ago
- 🚀 A comprehensive LangGraph tutorial collection with hands-on Jupyter notebooks 一个全面的 LangGraph 教程项目,通过 Jupyter Notebook 形式展现,涵盖了 LangG…☆36Sep 2, 2025Updated 7 months ago
- ☆16Mar 5, 2023Updated 3 years ago
- 北邮-软件工程-充电桩管理系统☆17Mar 15, 2024Updated 2 years ago
- 浙大校区空闲充电桩项目☆16Oct 8, 2024Updated last year
- 这是关于软件工程课程设计的代码仓库,我们的项目将计划针对“海外藏中国文物”进行信息采集、关于及在线服务☆16May 24, 2023Updated 2 years ago
- 同济大学CS《计算机组成原理课程设计》暑期作业TongJi University CS computer organization assignment☆13Jul 25, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AAAI 2022 (Official implementation of "pan-sharpening with customized transformer and invertible neural network")☆14Jul 31, 2022Updated 3 years ago
- 通过将残差网络作为编码器,改进UNet ( improving the unet by using the resnet as the encoder )☆13Jun 14, 2020Updated 5 years ago
- VCampus 虚拟校园 / 东南大学《软件实践》项目☆13Jul 5, 2021Updated 4 years ago
- 本人为同济大学21级数据科学与大数据技术专业本科生,于2023年春季前往台湾大学电机资讯学院资讯工程学系交换一个学期,以下是我学习os专业课的作业。☆11Jun 15, 2023Updated 2 years ago
- An LLM training framework built from the ground up, featuring a custom BumbleBee architecture and end-to-end support for multiple open-so…☆67Feb 9, 2026Updated 2 months ago
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆76Apr 22, 2026Updated last week
- 使用C#实现的读写锁,完成基本读写同步需求☆14May 2, 2019Updated 6 years ago
- JavaEE实现网上购物子系统,数据库课程设计☆19Sep 24, 2022Updated 3 years ago
- AI coding assistant skill for creating visually rich PowerPoint (.pptx) presentations with native OMML math, LaTeX formulas, and Graphviz…☆69Mar 17, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆25Mar 4, 2024Updated 2 years ago
- A Novel Approach for Protein Sequence Design Based on Secondary Strucure☆24Mar 7, 2025Updated last year
- An environment based on JSBSIM aimed at one-to-one close air combat.☆19Sep 14, 2025Updated 7 months ago
- 软件工程课众包跑腿项目☆22Oct 12, 2022Updated 3 years ago
- Open-source DSR research workflow template, work with Claude or other AI agents.☆72Sep 27, 2025Updated 7 months ago
- ☆35Jul 25, 2024Updated last year
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- ☆13Mar 25, 2021Updated 5 years ago
- WIoU implementation for the YOLOv8☆20Nov 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 江西师范大学操作系统课实验作业代码☆13May 9, 2025Updated 11 months ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 3 years ago
- 深圳技术大学数据结构oj答案(2021)☆22Mar 7, 2022Updated 4 years ago
- 东南大学任国林版《计算机组成原理》思维导图☆16Jan 23, 2021Updated 5 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- 同济大学计算机网络实验报告☆14Jan 1, 2022Updated 4 years ago