AIDajiangtang/LLM-from-scratch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AIDajiangtang/LLM-from-scratch)

AIDajiangtang / LLM-from-scratch

从零开始学大模型Transformer、GPT2、BERT pre-training and fine-tuning from scratch

☆41

Alternatives and similar repositories for LLM-from-scratch

Users that are interested in LLM-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hellangleZ / hospital_multiagent_system
View on GitHub
医疗问诊系统multi-agent框架
☆100Mar 30, 2025Updated last year
taishan1994 / classical_chinese_extraction
View on GitHub
文言文信息抽取（实体识别+关系抽取）
☆10Feb 24, 2023Updated 3 years ago
twosigma / beaker-notebook-archive
View on GitHub
Archive of Beaker Notebook
☆12May 9, 2017Updated 9 years ago
ZhuChaoY / SDKG-11
View on GitHub
Multimodal reasoning based on knowledge graph embedding for specific diseases
☆13Apr 29, 2024Updated 2 years ago
gsalaz98 / cinnamon_roll
View on GitHub
Limit Orderbook Replay/Analysis Library
☆10Nov 19, 2018Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
luka-group / NLI_as_Indirect_Supervision
View on GitHub
Official Implementation for "Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical Relation Extraction?" ACL 2023
☆15Jun 17, 2023Updated 3 years ago
wasiahmad / GATE
View on GitHub
Official implementation of our work, GATE: Graph Attention Transformer Encoder for Cross-lingual Relation and Event Extraction [AAAI 2021…
☆50Apr 16, 2023Updated 3 years ago
huan / tensorflow-handbook-javascript
View on GitHub
TensorFlow Handbook for JavaScript/TypeScript
☆20Nov 16, 2023Updated 2 years ago
youyou22222 / mobilenet-inference-code-tflite
View on GitHub
MobileNet-v1 tensorflow lite inference code written in c++
☆13Feb 13, 2019Updated 7 years ago
mlwithme / TransformerClassification
View on GitHub
A Transformer Framework Based Classification Task
☆26Feb 26, 2024Updated 2 years ago
THU-KEG / Event-Level-Knowledge-Editing
View on GitHub
☆12Apr 25, 2024Updated 2 years ago
kokororin / typecho-theme-Twitter
View on GitHub
A twitter liked typecho theme.
☆11Sep 29, 2015Updated 10 years ago
shaozhipeng / flink-quickstart
View on GitHub
Flink Streaming SQL | FlinkCEP | Some demos and notes
☆20Sep 22, 2020Updated 5 years ago
cshmzin / nlp-code
View on GitHub
nlp codes for study
☆18Mar 30, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
myhloli / Magic-PDF
View on GitHub
☆23Updated this week
taishan1994 / python3_wiki_word2vec
View on GitHub
基于python3训练中文wiki词向量、字向量、拼音向量
☆11Jan 2, 2022Updated 4 years ago
lr580 / shannon_sources
View on GitHub
华南师范大学软件协会香农先修班主要在2021秋季学期和2022春季学期的部分公开资料，包括以我本人为唯一作者或主要作者的课件、算法比赛资料(题面、题解、判题std数据、SPJ)。
☆15Dec 20, 2023Updated 2 years ago
LeonardoBerti00 / Multi-Horizon-Forecasting-for-Limit-Order-Books
View on GitHub
Pytorch implementation of DeepLOB-ATT and DeepLOB-Seq2Seq from Multi Horizon Forecasting for Limit Order Books
☆14Feb 4, 2023Updated 3 years ago
WKQ9411 / Mini-LLM
View on GitHub
This project aims to replicate mainstream open-source model architectures with limited computational resources, implementing mini models …
☆284Jun 14, 2026Updated last month
msft-vivi / RelationExtract-Pytorch
View on GitHub
Relation Extraction 论文复现
☆48Nov 2, 2019Updated 6 years ago
Robin-WZQ / CBLUE_CMeIE_model
View on GitHub
CBLUE2.0-关系抽取模型，基于pytorch
☆18Oct 23, 2024Updated last year
zigslice / pyouter
View on GitHub
router hierarchy and layered tasks, which may come from command line or http restful api
☆16Jul 10, 2025Updated last year
chufangao / TTM-RE
View on GitHub
ACL2024: TTM-RE Memory-Augmented Document-Level Relation Extraction
☆21Oct 6, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Victoria-Pinzhen-Liao / Functional-Programming
View on GitHub
These are my lecture notes and code for Coursera online course Functional Programming Principles in Scala by Prof. Martin Odersky from Éc…
☆20Jan 7, 2024Updated 2 years ago
JunnYu / jy_ner
View on GitHub
softmax, crf, biaffine, globalpointer, efficient globalpointer, ricon
☆17Oct 11, 2022Updated 3 years ago
mingmingyang / auto_spectral_clustering
View on GitHub
auto_spectral_clustering predicts number of clusters based on the eigengap.
☆12Mar 31, 2021Updated 5 years ago
kokororin / Kotori.php
View on GitHub
A Tiny Model-View-Controller(MVC) PHP Framework
☆11Dec 31, 2021Updated 4 years ago
qq1065507891 / ChineseAddressNER
View on GitHub
阿里中文地址要素解析
☆17Jun 10, 2022Updated 4 years ago
huiling-y / EventGraph
View on GitHub
☆16Oct 31, 2022Updated 3 years ago
KaihuaTang / Building-a-Small-LLM-from-Scratch
View on GitHub
该系列的目的是让读者可以在基础的pytorch上，不依赖任何其他现成的外部库，从零开始理解并实现一个大语言模型的所有组成部分，以及训练微调代码，因此读者仅需python，pytorch和最基础深度学习背景知识即可。
☆385Aug 28, 2025Updated 11 months ago
Event-AHU / Time_Series_Analysis
View on GitHub
time series data, pre-training, 1d signal, fusion
☆16Mar 6, 2026Updated 4 months ago
kesenzhao / DT4Rec
View on GitHub
☆21May 22, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ssw-nlp-study-group / nlp_study
View on GitHub
☆15Feb 8, 2023Updated 3 years ago
Enter-tainer / LuoguPaintAutomaton
View on GitHub
luogu冬日画板自动绘图脚本
☆11Dec 31, 2018Updated 7 years ago
CarlanLark / IPGPF
View on GitHub
Code for EMNLP 2023 long paper: An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extracti…
☆19Feb 2, 2025Updated last year
EleoXDA / Socialize_RB
View on GitHub
A place to find developers nearby for chat or meet
☆11Sep 21, 2023Updated 2 years ago
Tiga001 / learn-LLM-with-LLM
View on GitHub
LLM teach people something about LLM
☆19Mar 14, 2026Updated 4 months ago
StreamAI / C-Concurrence
View on GitHub
thread and atomic library
☆29Apr 11, 2020Updated 6 years ago
wencyxu / IRF-LLM-accepted-at-WWW25-
View on GitHub
Accepted at WWW 25 Industrial Track (oral)
☆18Jun 6, 2025Updated last year