一个手把手教你从零开始编写GPT并训练大语言模型的教程
☆101Jan 20, 2025Updated last year
Alternatives and similar repositories for ScratchLLMStepByStep
Users that are interested in ScratchLLMStepByStep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。☆385Aug 28, 2025Updated 9 months ago
- ☆15Apr 23, 2025Updated last year
- Building DeepSeek R1 from Scratch☆769Mar 21, 2025Updated last year
- Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…☆13Jan 29, 2025Updated last year
- a lab from ruc base☆12Jan 24, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆34Jul 8, 2025Updated 11 months ago
- Exploring Applications of GRPO☆253Aug 25, 2025Updated 9 months ago
- Go和大语言模型编程☆44Mar 5, 2025Updated last year
- Assignments of Physical Design for Nanometer ICs (Spring 2017, Prof. Yao-Wen Chang)☆46Dec 24, 2018Updated 7 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- go-namesys provides publish and resolution support for the /ipns/ namespace in go-ipfs☆14Jun 14, 2023Updated 2 years ago
- ☆11Oct 29, 2022Updated 3 years ago
- Wrap an io.Writer for metrics.☆10May 8, 2018Updated 8 years ago
- Introspected tunnels to localhost☆10Apr 18, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- GEMM☆10Aug 26, 2023Updated 2 years ago
- ☆10May 23, 2022Updated 4 years ago
- useful cuda code .☆43Mar 11, 2022Updated 4 years ago
- Distributed KV store using go-ds-crdt and libp2p☆12Nov 28, 2021Updated 4 years ago
- Fast and Flexible FPGA development using Hierarchical Partial Reconfiguration (FPT 2022)☆15Mar 21, 2024Updated 2 years ago
- Minimize server usage by leveraging a decentralized peer-to-peer network for ultra-low-latency live streaming among users.☆13Feb 19, 2024Updated 2 years ago
- ☆66Feb 15, 2026Updated 3 months ago
- rust implementation fo the DHT powering the HyperSwarm stack☆18Apr 1, 2022Updated 4 years ago
- go-libp2p's TLS encrypted transport☆16May 25, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Ada library and tools for the analysis of Complex Networks and more☆11Nov 16, 2023Updated 2 years ago
- ☆11May 16, 2026Updated 3 weeks ago
- ☆15Apr 23, 2026Updated last month
- AI 应用开发工程师面试宝典 - 二狗子整理☆181Jun 3, 2026Updated last week
- A simple and trans-platform rag framework and tutorial☆232Jan 17, 2026Updated 4 months ago
- 博客代码:快过年了,搞个AI作曲,用TensorFlow训练midi文件☆17Dec 24, 2022Updated 3 years ago
- ☆13Sep 25, 2021Updated 4 years ago
- ☆18Jan 25, 2025Updated last year
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆33May 26, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- corundum work on vu13p☆23Nov 10, 2023Updated 2 years ago
- Web app for deploying airdrops with claim links☆11Dec 8, 2022Updated 3 years ago
- Decentralized IPFS Pinning Service AVS☆13Oct 28, 2024Updated last year
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated 2 years ago
- A reinforcement learning object detector leveraging saliency ranking, offering a self-explainable system with a fully observable action l…☆14May 28, 2025Updated last year
- ☆14Nov 3, 2025Updated 7 months ago
- Train toy models using multi-token prediction objective☆14Apr 18, 2026Updated last month