从零开始学大模型Transformer、GPT2、BERT pre-training and fine-tuning from scratch
☆37Jul 1, 2024Updated last year
Alternatives and similar repositories for LLM-from-scratch
Users that are interested in LLM-from-scratch are comparing it to the libraries listed below
Sorting:
- 一个手把手教你从零开始编写GPT并训练大语言模型的教程☆96Jan 20, 2025Updated last year
- 华南师范大学软件协会香农先修班主要在2021秋季学期和2022春季学期的部分公开资料,包括以我本人为唯一作者或主要作者的课件、算法比赛资料(题面、题解、判题std数据、SPJ)。☆15Dec 20, 2023Updated 2 years ago
- Archive of Beaker Notebook☆12May 9, 2017Updated 8 years ago
- These are my lecture notes and code for Coursera online course Functional Programming Principles in Scala by Prof. Martin Odersky from Éc…☆21Jan 7, 2024Updated 2 years ago
- USTC Principles and Techniques of Compiler 2023 homepage☆27Oct 22, 2024Updated last year
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Jun 13, 2015Updated 10 years ago
- Accepted at WWW 25 Industrial Track (oral)☆18Jun 6, 2025Updated 9 months ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- 小鸡词典🐤的Alfred🎩插件 咯咯咯☆11Apr 19, 2023Updated 2 years ago
- NCSU GIS 595-601: Tools for open geospatial science☆16Nov 10, 2025Updated 3 months ago
- ☆41Jan 5, 2022Updated 4 years ago
- Simple and efficient tools for data science.☆12May 17, 2024Updated last year
- ☆11May 6, 2016Updated 9 years ago
- 使用biaffine的中文命名实体识别☆10Jan 12, 2023Updated 3 years ago
- ☆46Oct 17, 2025Updated 4 months ago
- Simple clock/cron process that monitors a specific directory and run jobs based on its filename.☆10Jun 8, 2020Updated 5 years ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Jan 21, 2021Updated 5 years ago
- ☆17May 8, 2014Updated 11 years ago
- Here I show how to use Deep Learning for biological and biomedical Data Integration.☆11Sep 17, 2020Updated 5 years ago
- Jupyterlab extension containing a UI for debugging☆10Dec 2, 2019Updated 6 years ago
- Quick and dirty debugging☆17Nov 13, 2025Updated 3 months ago
- Creating user interfaces for data science with Jupyter widgets☆11Oct 28, 2017Updated 8 years ago
- Python常用代码段☆11Sep 8, 2021Updated 4 years ago
- open source code for Atstake☆10Nov 11, 2020Updated 5 years ago
- This is the public repository of AAAI 2024 paper "Is a Large Language Model a Good Annotator for Event Extraction"☆10Feb 16, 2024Updated 2 years ago
- experiments with the R package TSclust☆11Mar 5, 2015Updated 11 years ago
- ☆10Jan 28, 2021Updated 5 years ago
- ☆12Mar 18, 2024Updated last year
- Project Based Learning Rust☆11Jan 16, 2024Updated 2 years ago
- Limit Orderbook Replay/Analysis Library☆10Nov 19, 2018Updated 7 years ago
- ☆58Feb 7, 2025Updated last year
- Deploy Pytorch models to production via panini☆10Mar 18, 2019Updated 6 years ago
- Python APIs for perspective front end☆15Jun 1, 2022Updated 3 years ago
- ☆12Apr 27, 2025Updated 10 months ago
- This is a tool to delete the remaining dependencies and cache files in the development environment, eg: nodule_modules、target...☆11Jul 22, 2024Updated last year
- Colab notebooks from Launch Data Science at HackCville☆13Jun 14, 2019Updated 6 years ago
- Code for "A Bilingual Generative Transformer for Semantic Sentence Embedding" published at EMNLP 2020.☆10Nov 20, 2020Updated 5 years ago
- ☆12Apr 25, 2024Updated last year
- Code of the Grounded MUIE model, REAMO☆11Dec 3, 2024Updated last year