一个手把手教你从零开始编写GPT并训练大语言模型的教程
☆99Jan 20, 2025Updated last year
Alternatives and similar repositories for ScratchLLMStepByStep
Users that are interested in ScratchLLMStepByStep are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 从零开始学大模型Transformer、GPT2、BERT pre-training and fine-tuning from scratch☆40Jul 1, 2024Updated last year
- 该系列的目的是让读者可以在基础的pytorch上,不依赖任何其他现成的外部库,从零开始理解并实现一个大语言模型的所有组成部分,以及训练微调代码,因此读者仅需python,pytorch和最基础深度学习背景知识即可。☆387Aug 28, 2025Updated 8 months ago
- Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…☆13Jan 29, 2025Updated last year
- ☆22Mar 1, 2025Updated last year
- 华为集合通信性能测试☆16May 27, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- a lab from ruc base☆13Jan 24, 2023Updated 3 years ago
- The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading t…☆28Jan 20, 2026Updated 3 months ago
- 本项目旨在利用LangChain和大语言模型(如ZhipuAI)开发一个智能数据库问答系统。 该系统能够通过自然语言理解用户的查询请求,自动生成相应的SQL语句并执行,最后将查询结果以自然语言 形式返回用户。☆16Jul 31, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Apr 13, 2026Updated last month
- ☆19Apr 11, 2024Updated 2 years ago
- ☆34Jul 8, 2025Updated 10 months ago
- Source and solution codes for Professional CUDA C Programming book.☆15Aug 20, 2020Updated 5 years ago
- Go和大语言模型编程☆44Mar 5, 2025Updated last year
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆17Feb 19, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Demo on how to write your own `kubectl exec` command with client-go☆13May 15, 2017Updated 9 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 3 months ago
- Kexplain is an interactive kubectl explain☆12Oct 23, 2023Updated 2 years ago
- go-namesys provides publish and resolution support for the /ipns/ namespace in go-ipfs☆14Jun 14, 2023Updated 2 years ago
- ☆11Oct 29, 2022Updated 3 years ago
- ☆11Dec 29, 2020Updated 5 years ago
- Scalable Kubernetes-native implementation of the Open Data Fabric protocol for global collaborative data processing☆23May 11, 2026Updated last week
- 基于Raft一致性协议的分布式存储系统,参考阿里巴巴SOFAJRaft并使用Java从零实现。Distributed storage system based on Raft consistency protocol, referencing Alibaba SOFAJRa…☆20Dec 14, 2022Updated 3 years ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10May 23, 2022Updated 3 years ago
- useful cuda code .☆43Mar 11, 2022Updated 4 years ago
- Official implementation of the papers "User-controlled federated matrix factorization for recommender systems" and "FedeRank: User Contro…☆18Jul 28, 2020Updated 5 years ago
- Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection☆13Jun 17, 2025Updated 11 months ago
- A collection of middlewares for Socket.IO☆21Jun 5, 2014Updated 11 years ago
- go-libp2p's TLS encrypted transport☆16May 25, 2022Updated 3 years ago
- Papers related to the Recommender System from SIGIR 2021 (including the links for Paper PDF, Github Code and Dataset)☆24Jun 9, 2021Updated 4 years ago
- 从零开始构建一个大型语言模型(LLM, Large Language Model)☆22Dec 19, 2024Updated last year
- ☆11Sep 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Resume modified by Later☆13Apr 8, 2021Updated 5 years ago
- ☆15Apr 23, 2026Updated 3 weeks ago
- Imageflow is a Raycast extension that allows you to process images using a customizable workflow. You can resize, compress, and convert i…☆13Apr 9, 2025Updated last year
- The scheduler of Volcano, built based on kubernetes-sigs/kube-batch☆14Jul 7, 2019Updated 6 years ago
- A simple and trans-platform rag framework and tutorial☆232Jan 17, 2026Updated 4 months ago
- 博客代码:快过年了,搞个AI作曲,用TensorFlow训练midi文件☆17Dec 24, 2022Updated 3 years ago
- A repository for all Signal-related Docker containers☆17Oct 4, 2024Updated last year