pihang/LLM_Learning_ph

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pihang/LLM_Learning_ph)

pihang / LLM_Learning_ph

从零预训练LLM、SFT、RLHF、DPO笔记整理+面试问题

☆21

Alternatives and similar repositories for LLM_Learning_ph

Users that are interested in LLM_Learning_ph are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WanderJN / llm_scratch
View on GitHub
[原理解析] 大模型基本功（手撕Transformer模型、手撕PPO、GRPO、DPO训练器）
☆29Jul 8, 2025Updated last year
101yang101 / CZY_ChatBot
View on GitHub
本项目是一个基于LangChain构建的多Agent系统，结合Streamlit实现的Web界面，能够根据用户输入进行网络搜索并提供旅游相关的聊天服务。此外，该系统还具备基于本地知识库的推销功能，为用户提供个性化的旅游产品推荐。
☆15Apr 20, 2025Updated last year
sober-clever / ReRe
View on GitHub
The implementations of paper "Reinforced Preference Optimization for Recommendation" (ReRe).
☆20Nov 16, 2025Updated 8 months ago
ZhangHaoyang493 / News_Recsys
View on GitHub
🔥🔥🔥 基于 PyTorch Lightning 和 MIND 数据集的模块化新闻推荐系统框架。实现了从特征工程到召回 (DSSM) 与排序 (Deep, DCN, WideDeep, FM) 的完整链路。
☆49Apr 12, 2026Updated 3 months ago
adobe-research / pdftriage
View on GitHub
☆16Oct 6, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
TiantianZhu110 / BioPRO
View on GitHub
☆12Jun 23, 2023Updated 3 years ago
NJUxlj / Chinese-MedQA-Qwen2
View on GitHub
基于Qwen2+SFT+DPO的医疗问答系统，项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练，其次，项目还调用各种知识库工具（neo4j, milvus, LDA, 等）进行自动化训练数据生成。另外，使用 vllm 用于推理…
☆89Apr 29, 2026Updated 2 months ago
ocastel / exact-extract
View on GitHub
☆12Sep 2, 2021Updated 4 years ago
qiufengqijun / open-r1-reprod
View on GitHub
这是一个open-r1的复现项目，对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练，观察到一些有趣的现象。
☆64Apr 13, 2025Updated last year
lzhangbv / acpsgd
View on GitHub
[ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
☆10Apr 28, 2023Updated 3 years ago
zhanghaok / BERT-MRC-NER
View on GitHub
基于BERT-MRC（阅读理解）的命名实体识别模型
☆20Mar 15, 2022Updated 4 years ago
BUAADreamer / llmkiller
View on GitHub
LLM手撕代码合集
☆23Mar 25, 2025Updated last year
fdalvi / analyzing-redundancy-in-pretrained-transformer-models
View on GitHub
Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020
☆14Oct 6, 2020Updated 5 years ago
zhangzg1 / rag-llm
View on GitHub
基于大语言模型的RAG项目，分别实现了基于文本和知识图谱的RAG
☆28Jul 3, 2026Updated 2 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
singularguy / CircuitManus
View on GitHub
☆123May 29, 2025Updated last year
Chrissie-Law / Causal-Domain-Clustering-for-Multi-Domain-Recommendation
View on GitHub
Official code for the SIGIR 2025 accepted paper "CDC: Causal Domain Clustering for Multi-Domain Recommendation".
☆15Aug 27, 2025Updated 10 months ago
SUSTech-HPCLab / CS305-2023Spring-Project
View on GitHub
☆11May 27, 2023Updated 3 years ago
k1l1 / CoCoFL
View on GitHub
CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization
☆13Aug 3, 2024Updated last year
pamaforce / OneKE-RAG
View on GitHub
基于 OneKE 的知识图谱构建与 RAG 问答系统搭建
☆27Jun 29, 2024Updated 2 years ago
LeeZChuan / DTVis-master
View on GitHub
DTVis：交通流量时空演变特征可视分析；数据源：2019CCF BDCI-可视化大赛
☆17Jul 15, 2021Updated 5 years ago
HITsz-TMG / Hansel
View on GitHub
Code and data of WSDM 2023 paper "Hansel: A Chinese Few-Shot and Zero-Shot Entity Linking Benchmark".
☆24Jun 1, 2023Updated 3 years ago
DXWEIE / ccks2025_pdf_multimodal
View on GitHub
My implementation in TianChi CCKS 2025 pdf QA multimodal competition
☆19Aug 27, 2025Updated 10 months ago
rainstorm12 / KG-RAG
View on GitHub
简单实现了一下基于知识图谱和文本文档联合做检索增强(RAG)大模型的实现，这里采用的数据分别是管廊维护领域的文本文档和专家知识图谱
☆24Jun 6, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
njxzc-ycx / BERT_PCNN-relation-extraction
View on GitHub
Bert + PCNN and PCNN 中文关系抽取任务
☆20Dec 30, 2022Updated 3 years ago
gzhuuser / fortune_teller
View on GitHub
本项目将基于多模态,RAG以及LLM等技术，打造了一个基于手相算命的系统
☆30Aug 28, 2024Updated last year
GerlinGreen / OneIE
View on GitHub
Forked from *OneIE: A Joint Neural Model for Information Extraction with Global Features*
☆21Sep 4, 2022Updated 3 years ago
wangruns / Roommate-Recommender-System
View on GitHub
2018年研究生室友推荐系统——Roommate Matching——简单小应用帮助同学寻找习性相同的室友
☆11Apr 3, 2019Updated 7 years ago
nju-websoft / SpanQualifier
View on GitHub
☆11Feb 21, 2024Updated 2 years ago
boschmitt / wishbone
View on GitHub
VHDL Implementation
☆15Oct 9, 2014Updated 11 years ago
GasolSun36 / NanoRAG
View on GitHub
Simple implementation of Retrieval-Augmented Generation System
☆28Oct 24, 2024Updated last year
173787247 / intelligent-customer-service
View on GitHub
智能客服系统架构与多Agent协作
☆29Oct 9, 2025Updated 9 months ago
DoctorKey / Practise
View on GitHub
[CVPR2023] Practical Network Acceleration with Tiny Sets
☆13Jul 28, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
xdong97 / TCMPR
View on GitHub
Source code and datasets for paper "TCMPR: TCM Prescription recommendation based on subnetwork term mapping and deep learning"
☆27Jan 10, 2023Updated 3 years ago
dgliu / WSDM24_MultiFS
View on GitHub
Experiments codes for WSDM '24 paper "MultiFS: Automated Multi-Scenario Feature Selection in Deep Recommender Systems"
☆11May 31, 2024Updated 2 years ago
Sherrylife / FedLMT
View on GitHub
[ICML2024] "FedLMT: Tackling System Heterogeneity of Federated Learning via Low-Rank Model Training with Theoretical Guarantees" by Jiaha…
☆14Sep 22, 2024Updated last year
IgorSokoloff / rr_with_compression_experiments_source_code
View on GitHub
Q-RR, DIANA-RR, Q-NASTYA, NASTYA-DIANA, QSGD, DIANA, FedCOM and FedPAQ on logistic loss with L2 regularization
☆11Nov 1, 2022Updated 3 years ago
lancopku / MUKI
View on GitHub
[Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
☆19Mar 16, 2023Updated 3 years ago
percent4 / R-BERT_for_people_relation_extraction
View on GitHub
使用R-BERT模型对人物关系模型进行分类，效果有显著提升。
☆24Mar 22, 2023Updated 3 years ago
selkerdawy / FTWT
View on GitHub
Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction
☆10May 25, 2022Updated 4 years ago