从零预训练LLM、SFT、RLHF、DPO笔记整理+面试问题
☆16Sep 2, 2024Updated last year
Alternatives and similar repositories for LLM_Learning_ph
Users that are interested in LLM_Learning_ph are comparing it to the libraries listed below
Sorting:
- 本项目是一个基于LangChain构建的多Agent系统,结合Streamlit实现的Web界面,能够根据用户输入进行网络搜索并提供旅游相关的聊天服务。此外,该系统还具备基于本地知识库的推销功能,为用户提供个性化的旅游产品推荐。☆16Apr 20, 2025Updated 10 months ago
- ☆13Sep 2, 2021Updated 4 years ago
- Sequence Tagging for Biomedical Extractive Question Answering (Bioinformatics'2020)☆11Jul 3, 2023Updated 2 years ago
- ☆12Dec 11, 2021Updated 4 years ago
- Natural Language Processing (NLP) and Large Language Models (LLM) with Fine-Tuning LLM and make Chatbot Question answering (QA) with LoRA…☆13Jan 20, 2024Updated 2 years ago
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆14Dec 17, 2023Updated 2 years ago
- ☆12Feb 21, 2024Updated 2 years ago
- 基于qwen3的医疗大模型研发全流程 0.分词训练 1.增量预训练 2.微调 3.强化 4.量化 5.蒸馏 6.评估 7.lora模型合并 8.服务 9.部署☆27Jan 3, 2026Updated last month
- Implementation of our paper "Towards Consistent Document-Level Entity Linking: Joint Models for Entity Linking and Coreference Resolution…☆12Nov 13, 2022Updated 3 years ago
- 第六届 中国软件杯 软件设计大赛 企业增值税发票数据分析系统☆15Aug 14, 2017Updated 8 years ago
- The objective of this project is to classify whether upcoming product will have positive or negative Sentiment.☆11May 18, 2019Updated 6 years ago
- Data and code for the paper Causal Reasoning of Entities and Events in Procedural Texts.☆12May 26, 2023Updated 2 years ago
- tf-idf 模型封装类,包含计算所有文档的tf-idf值,实现了基于tf-idf搜索引擎功能。根据query,计算与每个文档的相似度,返回与query相似度最高的topk文档☆16Nov 20, 2020Updated 5 years ago
- 2021 语言与智能技术竞赛关系 篇章级关系抽取☆18Sep 8, 2021Updated 4 years ago
- 东北大学自动健康打卡体温打卡☆18Feb 24, 2023Updated 3 years ago
- Implementation of the paper 'Improve Discourse Dependency Parsing with Contextualized Representations', Findings of NAACL 2022☆14Jul 15, 2022Updated 3 years ago
- ☆16Mar 6, 2020Updated 5 years ago
- ☆18Jan 31, 2023Updated 3 years ago
- 本仓库旨在记录和分享我在 LLM 和 Agent 领域的学习历程,并通过实践项目深入理解相关技术。通过从零开始构建基于 LLM 和 Agent 的应用,学习LLM原理和Agent开发经验。☆24Mar 28, 2025Updated 11 months ago
- 个人博客网站 前端页面模板 包含腾讯开源模板引擎art-template语法☆16Mar 19, 2019Updated 6 years ago
- ☆23Apr 15, 2025Updated 10 months ago
- This repository is the implementation of "Top-down RST Parsing Utilizing Granularity Levels in Documents" published at AAAI 2020.☆20Dec 14, 2020Updated 5 years ago
- 基于Vue的前端模板框架☆18Jan 5, 2023Updated 3 years ago
- 基于大语言模型的RAG项目,分别实现了基于文本和知识图谱的RAG☆27Dec 11, 2025Updated 2 months ago
- Agentic RAG with MCP Server☆35May 15, 2025Updated 9 months ago
- window下按键鼠标音效,包括钢琴音、音游模式、麻将音、木鱼音、疾风剑豪、二次元、只因你太美,☆29Nov 11, 2024Updated last year
- A task relevant entity linking toolkit☆22Apr 2, 2022Updated 3 years ago
- LaTeX template for Tianjin University Master's and Doctoral Theses. 天津大学硕博学位论文 LaTeX 模板。☆35Jan 29, 2026Updated last month
- [Course] Simple database in C++ (Database 2017)☆22Apr 1, 2019Updated 6 years ago
- Resource of School of Software Engineering, South China University of Technology.☆21Feb 13, 2022Updated 4 years ago
- 这是一个从零开始构建的强化学习人类反馈(RLHF)学习代码库,实现了 PPO、GRPO、GSPO 以及相关的策略优化算法,并提供了清晰、可复现的训练流程。由于文档是由latex文件转译过来,如果md文件渲染异常,请用VScode的md插件打开☆76Dec 19, 2025Updated 2 months ago
- ☆29Apr 22, 2024Updated last year
- 🔥🔥🔥 基于 PyTorch Lightning 和 MIND 数据集的模块化新闻推荐系统框架。实现了从特征工程到召回 (DSSM) 与排序 (Deep, DCN, WideDeep, FM) 的完整链路。☆42Updated this week
- 爱上美食,上线App,在安卓市场可下载,基于Retrofit+Glide+Gson的食谱APP☆24Sep 7, 2018Updated 7 years ago
- LangGraph agent template with MCP.☆30Apr 8, 2025Updated 10 months ago
- LIQUID: A Framework for List Question Anwering Dataset Generation (AAAI 2023)☆28Jun 7, 2023Updated 2 years ago
- ☆27Dec 12, 2024Updated last year
- 本项目将基于多模态,RAG以及LLM等技术,打造了一个基于手相算命的系统☆30Aug 28, 2024Updated last year
- 北京大学软件与微电子学院关键软件方向课程资料、作业等汇总(操作系统与虚拟化、深度学习技术与应用等)☆34Sep 8, 2024Updated last year