fangpin/llm-from-scratch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fangpin/llm-from-scratch)

fangpin / llm-from-scratch

Build LLM from scratch

☆123

Alternatives and similar repositories for llm-from-scratch

Users that are interested in llm-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JJXiangJiaoJun / cutlass_gemv
View on GitHub
GEMV implementation with CUTLASS
☆21Aug 21, 2025Updated 10 months ago
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated 11 months ago
HPMLL / NVIDIA-Hopper-Benchmark
View on GitHub
☆110May 31, 2025Updated last year
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 5 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
wangleiofficial / FAPEloss
View on GitHub
alphafold FAPE loss
☆10Sep 28, 2021Updated 4 years ago
ademeure / cuda-side-boost
View on GitHub
☆60Feb 24, 2026Updated 4 months ago
ZhangZhiPku / cutile-examples
View on GitHub
cutile kernel examples
☆50Apr 3, 2026Updated 3 months ago
JF-D / Proteus
View on GitHub
☆24Jul 7, 2024Updated 2 years ago
Wuyxin / GraphMETRO
View on GitHub
GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts (NeurIPS 2024)
☆29Mar 1, 2025Updated last year
LirongWu / GraphMixup
View on GitHub
Code for ECML-PKDD 2022 paper "GraphMixup: Improving Class-Imbalanced Node Classification by Reinforcement Mixup and Self-supervised Cont…
☆24Jun 7, 2023Updated 3 years ago
OpenMOSS / Embodied-Planner-R1
View on GitHub
Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning
☆27Mar 30, 2026Updated 3 months ago
IST-DASLab / Quartet-II
View on GitHub
Quartet II Official Code
☆76May 1, 2026Updated 2 months ago
KuangjuX / NVSHMEM-Tutorial
View on GitHub
NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
☆193Feb 11, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Cambricon / torch_mlu
View on GitHub
☆55Mar 15, 2025Updated last year
zhen8838 / handson-polyhedral
View on GitHub
tutorials about polyhedral compilation.
☆66Jun 6, 2026Updated last month
HKU-MMLab / EVATok
View on GitHub
[CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"
☆59Mar 13, 2026Updated 3 months ago
neighthan / gpu-utils
View on GitHub
Utility functions/scripts for working with GPUs.
☆10Jul 5, 2021Updated 5 years ago
phonism / genesis
View on GitHub
Gensis is a lightweight deep learning framework written from scratch in Python, with Triton as its backend for high-performance computing…
☆35Jan 15, 2026Updated 5 months ago
DeepLink-org / dlinfer
View on GitHub
☆74Updated this week
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
luongthecong123 / fp8-quant-matmul
View on GitHub
Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.
☆19Feb 9, 2026Updated 5 months ago
toyaix / tritonllm
View on GitHub
LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model
☆118Apr 28, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
aasthavar / finetune-evaluate-codestral
View on GitHub
Different approaches for finetuning, evaluating, optimizations for code generation model - codestral
☆11Jun 18, 2024Updated 2 years ago
yui0 / ugemm
View on GitHub
GEMM
☆10Aug 26, 2023Updated 2 years ago
liscustodio / modified_mc33
View on GitHub
Chernyaev’s Marching Cubes 33 is one of the first algorithms intended to preserve the topology of the trilinear interpolant. In this work…
☆14Jul 19, 2013Updated 12 years ago
JinjieNi / OpenMoE2
View on GitHub
The official repo for "OpenMoE 2: Sparse Diffusion Language Models".
☆58Dec 28, 2025Updated 6 months ago
ihavnoid / tg4perfetto
View on GitHub
Simple python library for generating your own perfetto traces for your application. Can be used for both app instrumentation and custom …
☆26Jun 22, 2025Updated last year
tony10101105 / HEAR-2021-NeurIPS-Challenge---NTU-GURA
View on GitHub
☆13Mar 7, 2022Updated 4 years ago
fwgood / gitdrop
View on GitHub
批量删库，取消star
☆12Jan 6, 2021Updated 5 years ago
keith2018 / TinyGPT
View on GitHub
Tiny C++ LLM inference implementation from scratch
☆119Jun 23, 2026Updated 2 weeks ago
yjxiong / WIDER2019FaceDetectionRuntimeContainerExample
View on GitHub
An example docker container for runtime evaluation for the WIDER 2019 challenge track: face detection accuracy and runtime.
☆17Aug 7, 2019Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
gemgotclass / shadowsocks
View on GitHub
shadowsocks
☆11Jun 15, 2019Updated 7 years ago
EfficientLLMSys / MuxServe
View on GitHub
☆15Jun 26, 2024Updated 2 years ago
alibaba / vstyle
View on GitHub
☆33Sep 15, 2025Updated 9 months ago
AlexwellChen / Toy_ML_Framework
View on GitHub
☆11May 16, 2026Updated last month
JimyMa / FuncTs
View on GitHub
[DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning
☆15Jan 13, 2024Updated 2 years ago
sustcsonglin / fla-tilelang
View on GitHub
☆37Mar 7, 2025Updated last year
XIANGLONGYAN / PBS2P
View on GitHub
PyTorch code for our paper "Progressive Binarization with Semi-Structured Pruning for LLMs"
☆13Mar 11, 2026Updated 3 months ago