一个手把手教你从零开始编写GPT并训练大语言模型的教程
☆96Jan 20, 2025Updated last year
Alternatives and similar repositories for ScratchLLMStepByStep
Users that are interested in ScratchLLMStepByStep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。☆385Aug 28, 2025Updated 7 months ago
- ☆15Apr 23, 2025Updated 11 months ago
- Building DeepSeek R1 from Scratch☆751Mar 21, 2025Updated last year
- Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…☆13Jan 29, 2025Updated last year
- ☆21Mar 1, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 本项目旨在利用LangChain和大语言模型(如ZhipuAI)开发一个智能数据库问答系统。 该系统能够通过自然语言理解用户的查询请求,自动生成相应的SQL语句并执行,最后将查询结果以自然语言 形式返回用户。☆17Jul 31, 2024Updated last year
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 7 months ago
- Go和大语言模型编程☆44Mar 5, 2025Updated last year
- PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model☆28Oct 10, 2024Updated last year
- Sequences from Adaptyv Bio’s EGFR Protein Design Competition☆15Aug 28, 2025Updated 7 months ago
- Explore and express your inner voice through personalized conversations with HeartEcho, a platform dedicated to understanding and evolvin…☆26Aug 16, 2024Updated last year
- Scalable Kubernetes-native implementation of the Open Data Fabric protocol for global collaborative data processing☆22Mar 19, 2026Updated last week
- GEMM☆10Aug 26, 2023Updated 2 years ago
- Minimize server usage by leveraging a decentralized peer-to-peer network for ultra-low-latency live streaming among users.☆13Feb 19, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆63Feb 15, 2026Updated last month
- A collection of middlewares for Socket.IO☆21Jun 5, 2014Updated 11 years ago
- [NeurIPS 2025] Reward-Instruct: A Reward-Centric Approach to Fast Photo-Realistic Image Generation☆35Oct 24, 2025Updated 5 months ago
- ☆15Jun 22, 2025Updated 9 months ago
- Run-length encoding utils for Go☆13May 8, 2018Updated 7 years ago
- Official repository for "Plug & Play Directed Evolution for Proteins with Gradient-Based Discrete MCMC"☆12Jul 18, 2023Updated 2 years ago
- A simple and trans-platform rag framework and tutorial☆230Jan 17, 2026Updated 2 months ago
- A project of fault localization in time series data☆12Apr 18, 2019Updated 6 years ago
- Messenger provides a simple arbitrary message sending API to multiple peers for libp2p-based protocols.☆20Mar 17, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Web app for deploying airdrops with claim links☆11Dec 8, 2022Updated 3 years ago
- Distributed hash-table node☆13Oct 2, 2023Updated 2 years ago
- Cute layout visualization☆33Jan 18, 2026Updated 2 months ago
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 7 months ago
- ☆15Mar 7, 2019Updated 7 years ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆629Feb 24, 2025Updated last year
- 通过带领大家解读Transformer模型来加深对模型的理解☆240Jun 3, 2025Updated 9 months ago
- 透過製作一個簡單的履歷 app,快速了解 SwiftUI 的開發過程。☆43Apr 25, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 《汇编语言一发入魂》配套代码☆15May 30, 2020Updated 5 years ago
- Publish dynamic multiaddresses of private or isolated nodes using IPNS. Benefit - 1. Bandwidth savings, 2. Avoiding DDNS 3. Securely expo…☆14Nov 11, 2021Updated 4 years ago
- All Resources from Stanford CS106B 2021☆24Jul 11, 2025Updated 8 months ago
- "TILT: Transform Invariant Low-rank Textures" CPP port.☆22Jun 25, 2021Updated 4 years ago
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆164Nov 25, 2025Updated 4 months ago
- A distributed hash table from scratch☆12Sep 19, 2017Updated 8 years ago
- 2017工业大数据创新竞赛/风机叶片结冰预测大赛☆48Nov 15, 2018Updated 7 years ago