EthanLiu6 / LLM_knowledgeLinks

- 【LLM面经】大模型实习面试指南。手撕代码、面经经验、思考题等。初学者学习ing......欢迎指正错误

☆21

Alternatives and similar repositories for LLM_knowledge

Users that are interested in LLM_knowledge are comparing it to the libraries listed below

Sorting:

WalkerMitty / Fast-Llama2
Fast instruction tuning with Llama2
☆11Updated last year
bbruceyuan / bit-brain
最少使用 3090 即可训练自己的比特大脑（miniLLM）🧠（进行中）. Train your own BitBrain(A mini LLM) with just an RTX 3090 minimum.
☆39Updated 5 months ago
liucongg / LLMsBook
大型语言模型实战指南：应用实践与场景落地
☆83Updated last year
shuyhere / all-about-llm
大语言模型训练和服务调研
☆36Updated 2 years ago
NonlinearWorld001 / LLMs-learning
小模型LLM的搭建，学习LLM的建模、训练过程基于DeepSeek-MOE架构的小模型，用于个人学习，从0开始，解释每一条语句
☆11Updated 8 months ago
yongzhuo / qwen2-sft
Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理
☆69Updated last year
yanqiangmiffy / KDD2024-WhoIsWho-Top3
KDD2024-WhoIsWho-Top3
☆16Updated last year
5663015 / LLMs_train
一套代码指令微调大模型
☆38Updated 2 years ago
airaria / GRAIN
GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models
☆19Updated 2 years ago
mobvoi / seq-monkey-data
☆170Updated last year
shengtaovvv / Dialogue
本项目由三个模块构成。意图识别：判断用户的意图是业务型还是闲聊型；模型检索：该部分构建一个语料库，当用户发起新的query（通过意图识别判断为业务型对话）时，为用户匹配query检索的最佳response，使用HSWN进行召回（粗排），然后构建句子的相似度，并利用Lig…
☆12Updated 4 years ago
taishan1994 / pytorch-distributed-NLP
pytorch分布式训练
☆73Updated 2 years ago
enze5088 / ChineseModernBert
中文预训练ModernBert
☆95Updated 8 months ago
Lacusking / linux_clash
为centos服务器配置clash服务
☆13Updated last year
heyblackC / BetterMixture-Top1-Solution
天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案
☆32Updated last year
zysNLP / quickllm
A repo for update and debug Mixtral-7x8B、MOE、ChatGLM3、LLaMa2、 BaChuan、Qwen an other LLM models include new models mixtral, mixtral 8x7b, …
☆47Updated 2 months ago
toby0077 / breast-Cancer-sklearn
breast Cancer乳腺癌数据挖掘，python sklearn
☆11Updated 6 years ago
yxuansu / Chinese-TaCL-BERT-NER-CWS
基于中文TaCL-BERT的中文命名实体识别及中文分词
☆32Updated 4 years ago
yanqiangmiffy / Agent-Tutorials-ZH
大模型智能体Agent中文教程，博客代码仓库
☆54Updated last month
km1994 / nlp_paper_study_text_match
仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【文本匹配篇】
☆13Updated 3 years ago
MetaGLM / OpenLM
本项目致力于为大模型领域的初学者提供全面的知识体系，包括基础和高阶内容，以便开发者能迅速掌握大模型技术栈并全面了解相关知识。
☆62Updated 11 months ago
ZBayes / poc_project
通用简单工具项目
☆22Updated last year
datawhalechina / hands-dirty-nlp
本课程面对具有一定机器学习基础，但尚未入门的NLPer或经验尚浅的NLPer，尽力避免陷入繁琐枯燥的公式讲解中，力求用代码展示每个模型背后的设计思想，同时也会带大家梳理每个模块下的技术演变，做到既知树木也知森林。
☆89Updated 2 years ago
ECNUdase / Seminar-PRML
《Pattern Recognition and Machine Learning》阅读讨论班
☆35Updated 6 years ago
AI-Study-Han / Mini-Llama2-Chinese
想要从零开始训练一个中文的mini大语言模型，可以进行基本的对话，模型大小根据手头的机器决定
☆65Updated last year
dasiki / Dialog-System-with-Task-Retrieval-and-Seq2seq
京东/淘宝客服对话数据公开，seq2seq生成模型设计对话系统获第二名
☆44Updated 3 years ago
datawhalechina / whale-paper
Datawhale论文分享，阅读前沿论文，分享技术创新
☆51Updated last year
Ginjing-Yuan / QWen2-from_ground_up
☆21Updated last year
taishan1994 / python_common_code_collection
收集经常用到的一些python代码
☆51Updated 6 months ago
cwxndl / LLM
大语言模型应用：RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛
☆74Updated 10 months ago