ShaohonChen / transformers_from_scratchLinks
pretrain a wiki llm using transformers
☆48Updated 10 months ago
Alternatives and similar repositories for transformers_from_scratch
Users that are interested in transformers_from_scratch are comparing it to the libraries listed below
Sorting:
- 通义千问VLLM推理部署DEMO☆586Updated last year
- TinyRAG☆314Updated 2 weeks ago
- 从0开始,将chatgpt的技术路线跑一遍。☆243Updated 10 months ago
- 通义千问的DPO训练☆50Updated 9 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆64Updated 5 months ago
- Alpaca Chinese Dataset -- 中文指令微调数据集☆209Updated 9 months ago
- 一些 LLM 方面的从零复现笔记☆209Updated 2 months ago
- Retriever-0.1B☆93Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆56Updated 11 months ago
- 快速入门RAG与私有化部署☆193Updated last year
- qwen ai agent☆135Updated last year
- 大模型/LLM推理和部署理论与实践☆293Updated this week
- LLM101n: Let's build a Storyteller 中文版☆131Updated 11 months ago
- 探索 LLM 在法律行业的应用潜力☆90Updated 7 months ago
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆451Updated 3 months ago
- 基于ReAct手搓一个Agent Demo☆141Updated 2 weeks ago
- DeepSeek 系列工作解读、扩展和复现。☆662Updated 3 months ago
- 尝试自己从头写一个LLM,参考llama和nanogpt☆62Updated last year
- ☆83Updated 5 months ago
- FinQwen: 致力于构建一个开放、稳定、高质量的金融大模型项目,基于大模型搭建金融场景智能问答系统,利用开源开放来促进「AI+金融」。☆397Updated last year
- simple decoder-only GTP model in pytorch☆41Updated last year
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆452Updated 2 months ago
- 本项目为书籍《大模型RAG实战》的代码以及资料汇总。☆238Updated 8 months ago
- 大型语言模型实战指南:应用实践与场景落地☆74Updated 10 months ago
- Qwen3 Fine-tuning: Medical R1 Style Chat☆106Updated last month
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆555Updated last year
- 一些大语言模型和多模态模型的生态,主要包括跨模态搜索、投机解码、QAT量化、多模态量化、ChatBot、OCR☆184Updated 3 weeks ago
- 本项目是针对RAG中的Retrieve阶段的召回技术及算法效果所做评估实验。使用主体框架为LlamaIndex.☆264Updated 6 months ago
- Awesome Chinese LLM: A curated list of Chinese Large Language Model 中文大语言模型数据集和模型资料汇总☆159Updated last year
- 大模型技术栈一览☆108Updated 9 months ago