ZiQiangXie/llm-from-scratch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZiQiangXie/llm-from-scratch)

ZiQiangXie / llm-from-scratch

LLM implementation one matrix multiplication at a time

☆13

Alternatives and similar repositories for llm-from-scratch

Users that are interested in llm-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

P1ayer-1 / Llama-LibTorch
View on GitHub
Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5
☆16Sep 19, 2024Updated last year
barneyhill / minBERT
View on GitHub
A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)
☆12Mar 20, 2023Updated 3 years ago
SuriyaaVijay / Digital-Wellbeing
View on GitHub
Digital Wellbeing for Linux is an open-source project designed to promote healthy digital habits and improve overall well-being. With a f…
☆10Oct 2, 2023Updated 2 years ago
Alex2Yang97 / local-full-stack-deep-research
View on GitHub
A full-stack local deep research application built with LangGraph, supporting multiple LLM providers and search APIs. Powered by FastAPI …
☆15Jun 15, 2025Updated last year
llnl / protein_tune_rl
View on GitHub
Protein design with infilling language models and reinforcement learning — for antibodies and beyond.
☆15Nov 25, 2025Updated 8 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
QunBB / bert-pretraining
View on GitHub
BERT&RoBERTa预训练代码，tensorflow和torch两种版本实现
☆13Feb 8, 2023Updated 3 years ago
XinmingTu / alphagenome
View on GitHub
Implementation of AlphaGenome, Deepmind's updated genomic attention model
☆15Feb 4, 2026Updated 5 months ago
zhanglabtools / CAMEX
View on GitHub
☆15Dec 19, 2025Updated 7 months ago
emdann / sc_target_evidence
View on GitHub
Meta-analysis of drug target evidence in single-cell data
☆17Oct 22, 2024Updated last year
SaskiaFreytag / spatial_brain_cancer
View on GitHub
☆17Sep 16, 2023Updated 2 years ago
shyhirt / AutoDub
View on GitHub
Automatic video translator and dubber using Whisper, XTTS v2 for voice cloning, and Ollama for local LLM translation. Supports 100+ langu…
☆19May 4, 2026Updated 2 months ago
digitalocean / go-ps
View on GitHub
Find, list, and inspect processes from Go (golang).
☆10Feb 4, 2018Updated 8 years ago
aralab-unr / ga-drl-aubo-ara-lab
View on GitHub
This is the code for GA-DRL-Aubo paper
☆15Apr 8, 2022Updated 4 years ago
oxpig / ABB4
View on GitHub
Antibody structure prediction model for sampling conformational ensembles
☆16Apr 14, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Wenyuan-AI4science / AetherCell
View on GitHub
AetherCell is a hierarchical generative framework designed to predict context-specific transcriptomic responses to drugs and genetic pert…
☆20Jul 22, 2026Updated last week
codingforentrepreneurs / Django-CRM
View on GitHub
Learn how to build your own Customer Relationship Manager with Python, Django, Google Auth Platform, Tiger Data, TimescaleDB, and more. B…
☆23Oct 30, 2025Updated 8 months ago
KempnerInstitute / chess-research
View on GitHub
☆11Jun 17, 2024Updated 2 years ago
anishiisc / Build_LLM_from_Scratch
View on GitHub
A notebook based tutorial series on buildling a LLM from scratch
☆27Sep 17, 2024Updated last year
Evanwu1125 / LiteCoT
View on GitHub
☆17Jun 10, 2025Updated last year
MLH / oracle-ghw-ai-ml-week-challenges
View on GitHub
☆14Aug 7, 2024Updated last year
cliffzhou92 / STT
View on GitHub
☆19May 5, 2024Updated 2 years ago
phbradley / ADAPT
View on GitHub
Antigen-receptor Design Against Peptide-MHC Targets
☆21Jan 9, 2026Updated 6 months ago
epang-ucas / Evaluate_LLMs_to_Genes
View on GitHub
☆19May 25, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ShaYeBuHui01 / flash_attention_inference
View on GitHub
Performance of the C++ interface of flash attention and flash attention v2 in large language model (LLM) inference scenarios.
☆15Aug 31, 2023Updated 2 years ago
axeld5 / pali_reason
View on GitHub
Testing paligemma2 finetuning on reasoning dataset
☆18Dec 28, 2024Updated last year
didiforgithub / MetaGPT-AFLow
View on GitHub
AFlow & MathAI
☆18Feb 24, 2025Updated last year
js-lan / competition_codes
View on GitHub
☆10Feb 23, 2021Updated 5 years ago
kono-dada / Ling-Pet
View on GitHub
☆28Oct 2, 2025Updated 9 months ago
caps-tum / mt4g
View on GitHub
Memory Topology for GPUs
☆19Jul 20, 2026Updated last week
HKUSTDial / ChartInsights
View on GitHub
Officical repository for the paper“ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering”(EMN…
☆22Nov 16, 2024Updated last year
risia / CUDA-SPICE-Circuit-Sim
View on GitHub
☆17Dec 10, 2018Updated 7 years ago
jordan-g / PyTorch-cuDNN-Convolution
View on GitHub
PyTorch extension enabling direct access to cuDNN-accelerated C++ convolution functions.
☆13Mar 14, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
aeilot / hexo-theme-paperwhite
View on GitHub
A minimalist theme
☆18Jan 18, 2025Updated last year
KuangjuX / cu-x
View on GitHub
🎉My Collections of CUDA Kernels~
☆11Jun 25, 2024Updated 2 years ago
kq-chen / qwen-vl-utils
View on GitHub
helper functions for processing and integrating visual language information with Qwen-VL Series Model
☆17Aug 30, 2024Updated last year
FreedomIntelligence / TinyDeepSeek
View on GitHub
Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.
☆30Mar 11, 2025Updated last year
Evanwu1125 / AutoWebWorld
View on GitHub
☆25Jul 10, 2026Updated 2 weeks ago
IST-DASLab / gemm-fp8
View on GitHub
High Performance FP8 GEMM Kernels for SM89 and later GPUs.
☆21Jan 24, 2025Updated last year
p-ranav / container_traits
View on GitHub
Container Traits for Modern C++
☆29Oct 11, 2020Updated 5 years ago