ShaohonChen / transformers_from_scratch
pretrain a wiki llm using transformers
☆20Updated 4 months ago
Alternatives and similar repositories for transformers_from_scratch:
Users that are interested in transformers_from_scratch are comparing it to the libraries listed below
- 🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)☆24Updated this week
- ☆24Updated last month
- 大模型检索增强生成技术最佳实践。☆54Updated 4 months ago
- ☆21Updated last year
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆52Updated 2 months ago
- Zero-human, cold-start construction of long-chain agents in professional domains☆20Updated this week
- LLaMA Factory Document☆88Updated last month
- 顾名思义:手搓的RAG☆116Updated 10 months ago
- 通义千问的DPO训练☆30Updated 3 months ago
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆52Updated last month
- SUS-Chat: Instruction tuning done right☆48Updated last year
- 中文原生检索增强生成测评基准☆105Updated 9 months ago
- 使用单个24G显卡,从0开始训练LLM☆50Updated 2 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆68Updated 4 months ago
- Generate dialog data from documents using LLM like ChatGLM2 or ChatGPT;利用ChatGLM2,ChatGPT等大模型根据文档生成对话数据集☆146Updated last year
- LLM101n: Let's build a Storyteller 中文版☆121Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆127Updated last month
- 专注于对话系统领域的技术分享,重点写《Dify应用操作和源码剖析》专栏。☆71Updated 6 months ago
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆57Updated 4 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated 9 months ago
- ☆81Updated 5 months ago
- ☆55Updated 10 months ago
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆33Updated last month
- accelerate generating vector by using onnx model☆13Updated 11 months ago
- 用于AIOPS24挑战赛的Demo☆59Updated 7 months ago
- unify-easy-llm(ULM)旨 在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆53Updated 5 months ago
- 大语言模型训练和服务调研☆35Updated last year
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆180Updated 2 months ago
- 大型语言模型实战指南:应用实践与场景落地☆49Updated 4 months ago
- Imitate OpenAI with Local Models☆85Updated 4 months ago