从零到一实现一个 miniLLM~(动手学习LLM)
☆78Apr 30, 2024Updated last year
Alternatives and similar repositories for LLMs-101
Users that are interested in LLMs-101 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆541Mar 23, 2025Updated last year
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆496May 1, 2025Updated 10 months ago
- Chinese license plate recognition☆28Nov 13, 2021Updated 4 years ago
- [EMNLP 2023 (Findings)] Schema-adaptable Knowledge Graph Construction☆22Jan 28, 2024Updated 2 years ago
- ☆11Nov 18, 2024Updated last year
- 雅思词汇真经、雅思语法、听力 179、阅读 538 同义替换等。Everything during preparing for my IELTS exam.☆17Feb 21, 2024Updated 2 years ago
- 数字人+大模型☆26Nov 7, 2023Updated 2 years ago
- 自动检查脚本&雅思听力真题语料库 机考笔试第二版☆12Jan 11, 2024Updated 2 years ago
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.☆2,901May 21, 2024Updated last year
- RecBase: Generative Foundation Model Pretraining for Zero-Shot Recommendation☆43Dec 9, 2025Updated 3 months ago
- 基于电商导购机器人,自然语言理解(NLU) ,文本纠错,歧义词消歧☆12May 5, 2020Updated 5 years ago
- code for "Deep Learning for Sequential Recommendation: Algorithms, Influential Factors, and Evaluations"☆12Sep 7, 2020Updated 5 years ago
- PinData is a modern, open-source dataset management platform designed specifically for large language model (LLM) training workflows☆44Jul 7, 2025Updated 8 months ago
- ANN-based Expectations Algorithm applied to the Neoclassical Investment Model☆10Mar 15, 2023Updated 3 years ago
- Here is the repo for public scripts.☆11Jul 16, 2022Updated 3 years ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆26Jun 24, 2024Updated last year
- Replication material for "Optimal Automatic Stabilizers"☆11Aug 9, 2021Updated 4 years ago
- 从0开始,将chatgpt的技术路线跑一遍。☆276Sep 5, 2024Updated last year
- 从零实现一个小参数量中文大语言模型。☆979Aug 22, 2024Updated last year
- Vary-tiny codebase upon LAVIS (for training from scratch)and a PDF image-text pairs data (about 600k including English/Chinese)☆86Sep 21, 2024Updated last year
- nanoGPT using Equinox☆15Mar 3, 2023Updated 3 years ago
- ☆20Jan 17, 2026Updated 2 months ago
- Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths☆17Jul 10, 2025Updated 8 months ago
- ☆12Aug 6, 2024Updated last year
- ☆11Dec 8, 2025Updated 3 months ago
- ☆13Jan 10, 2023Updated 3 years ago
- VGrow for deep generative learning☆12Feb 1, 2019Updated 7 years ago
- ☆10Jan 25, 2018Updated 8 years ago
- Replication fles for numerical solution in "Monetary Policy, Redistribution, and Risk Premia"☆13Jan 23, 2024Updated 2 years ago
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,682Apr 20, 2024Updated last year
- ☆14Jul 25, 2019Updated 6 years ago
- 关于UniswapV2,所有你需要了解的一切,这里估计一定都有!文档+视频(合约解析部署、前端部署、subgraph解析部署)☆10Dec 9, 2022Updated 3 years ago
- LLM Agents: Landing Page Generation for an E-commerce Platform Using CrewAI, Groq-LangChain and Qdrant☆15May 30, 2024Updated last year
- Solution to Macroeconomic Models using Python☆12Oct 1, 2024Updated last year
- Curating Cognitive Behavioral Therapy☆13Dec 21, 2023Updated 2 years ago
- a autodl environment for native finetune stable diffusion.☆11Dec 7, 2022Updated 3 years ago
- Data and baseline code of EMNLP 2021 paper "MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset".☆31Nov 5, 2021Updated 4 years ago
- 用Numpy复现可训练的LLaMa3☆34Jul 5, 2024Updated last year
- AFFNet-Unofficial Implementation☆15Aug 23, 2023Updated 2 years ago