lm-scratch-pytorch - The code is designed to be beginner-friendly, with a focus on understanding the fundamentals of PyTorch and implementing LLMs from scratch,step by step.
☆100Jan 27, 2026Updated 2 months ago
Alternatives and similar repositories for llm-scratch-pytorch
Users that are interested in llm-scratch-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jun 20, 2022Updated 3 years ago
- Compiler_Principle☆11Jun 5, 2015Updated 10 years ago
- 老面 (sourdough, lit. "old dough") is used as the starter for fermenting dough. This repository contains my digital 老面.☆12Sep 23, 2025Updated 6 months ago
- This repository contains (I hope) useful text data from the Federal Reserve. For people interested in text analysis, I have scraped the t…☆10Apr 24, 2022Updated 3 years ago
- 我陈平安,唯有一键,可搬山,倒海,降妖,镇魔,敕神,摘星,断江,摧城,开天!☆22Jun 4, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆18Nov 19, 2022Updated 3 years ago
- KAFKA 100-Day Challenge☆14Jan 13, 2026Updated 3 months ago
- 中国科学技术大学课程资源☆12Mar 8, 2019Updated 7 years ago
- SLAM algorithm☆17May 30, 2023Updated 2 years ago
- Feeling confused about super alignment? Here is a reading list☆43Jan 9, 2024Updated 2 years ago
- 《二十一世紀》歸檔計劃☆38Aug 1, 2025Updated 8 months ago
- JavaScript代码笔记!☆24Apr 27, 2022Updated 3 years ago
- ☆31Nov 23, 2021Updated 4 years ago
- a GitHub client to help you finding intresting projects.☆52Jan 26, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆48Jan 21, 2025Updated last year
- 翠翠的友链屋 - RSS 聚合友链 Blog 文章☆47Mar 22, 2026Updated 3 weeks ago
- ☆97Aug 6, 2022Updated 3 years ago
- 从diy行为艺术到diy苏格拉底式对话,从diy一个仪式到diy 一次旷课,各种活动指南的百科。diy💔是706孵化的一个非代码开源项目。☆55May 21, 2022Updated 3 years ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated last year
- slack-mini版(react、redux、ts、firebase、styled-components、vite、pnpm)☆61Aug 29, 2022Updated 3 years ago
- Elasticsearch with BERT for advanced document search.☆898May 1, 2023Updated 2 years ago
- 南京大学 Linux Users Group 收集同学和校友们的 Blog☆77Mar 20, 2026Updated 3 weeks ago
- Flask-related job opportunities.☆77Jan 5, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- P-tuning方法在中文上的简单实验☆140Apr 8, 2021Updated 5 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆152Jun 10, 2022Updated 3 years ago
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.☆184Aug 12, 2020Updated 5 years ago
- https://acl2023-retrieval-lm.github.io/☆156Oct 18, 2023Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆210Jan 13, 2024Updated 2 years ago
- Graceful exit when `uncaughtException` emit, base on `process.on('uncaughtException')`.☆250Dec 15, 2024Updated last year
- Paper list for open-ended language generation☆191Nov 17, 2022Updated 3 years ago
- CN MonaD.ReadeR Reading Group☆107Mar 3, 2016Updated 10 years ago
- 这是一门包罗万象但又井然有序、既有岁月沉淀但又非常潮流的 SRE (Site Reliability Engineer) 教程。☆138Oct 9, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 中文图书语料MD5链接☆217Jan 31, 2024Updated 2 years ago
- DataCLUE: 数据为中心的NLP基准和工具包☆144May 11, 2022Updated 3 years ago
- A simple tool to import/export your book note☆159Jul 27, 2023Updated 2 years ago
- Awesome papers for role-playing with language models☆220Nov 3, 2024Updated last year
- Datasets for Instruction Tuning of Large Language Models☆261Nov 30, 2023Updated 2 years ago
- Question Answering using Albert and Electra☆208Jun 12, 2023Updated 2 years ago
- awesome PointCloud processing algorithm☆131May 30, 2023Updated 2 years ago