KrishM123 / transformer.cpp

TransformerCPP is a minimal C++ machine learning library with autograd and tensor ops, inspired by PyTorch. It includes a from-scratch Transformer model demo, optimized for CPU and multithreaded performance.

☆19

Alternatives and similar repositories for transformer.cpp:

Users that are interested in transformer.cpp are comparing it to the libraries listed below

smolorg / smolgrad
small auto-grad engine inspired from Karpathy's micrograd and PyTorch
☆268Updated 5 months ago
astledsa / Deep-Learning-C
☆47Updated last month
omkaark / simple-federated-learning
☆99Updated last year
tgautam03 / xGeMM
Accelerated General (FP32) Matrix Multiplication from scratch in CUDA
☆114Updated 3 months ago
tensara / tensara
Competitive GPU kernel optimization platform.
☆60Updated this week
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆46Updated last month
smolorg / smolar
a tiny multidimensional array implementation in C similar to numpy, but only one file.
☆227Updated 9 months ago
ishan0102 / rsrch.space
Stream of my favorite papers and links
☆41Updated last month
joey00072 / Tinytorch
A really tiny autograd engine
☆92Updated last year
unixpickle / learn-ptx
Learning about CUDA by writing PTX code.
☆128Updated last year
pritishmishra703 / Scikit-Learn-From-Scratch
Let's make all Machine learning algorithms from scratch!
☆13Updated 10 months ago
loganwatchorn / notes-pmpp
Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)
☆52Updated 9 months ago
spikedoanz / from-bits-to-intelligence
could we make an ml stack in 100,000 lines of code?
☆42Updated 9 months ago
sankalp1999 / semantweet-search
Vector search over tweets from the tweet archive using OpenAI embeddings and LanceDB
☆53Updated last year
neu-reseau / neupy
a simple numpy alternative in C
☆24Updated 7 months ago
nqureshi / ev-winners
Semantic search over every Emergent Ventures winner.
☆20Updated last month
sincethestudy / fastask
FastAsk is a Python package that installs an easy to use command to your terminal to get a quick answer to a question, using either OpenA…
☆56Updated 4 months ago
seatedro / arxival
☆88Updated 3 months ago
sankalp1999 / twitter-circle
Uses Twitter archive to visualize your Twitter network based on your replies, quote tweets and direct messaging history. Get DM stats wit…
☆129Updated 10 months ago
spikedoanz / tensor-tic-tac-toe
parallelized hyperdimensional tictactoe
☆117Updated 8 months ago
obadakhalili / tinygrad-tensor-puzzles
Solve puzzles to improve your tinygrad skills!
☆123Updated last month
naklecha / arc-agi-attempts
In this repository I have a code and brief explanations of the attempts that I made at the ARC-AGI (2024) challenges :)
☆23Updated 5 months ago
siboehm / ShallowSpeed
Small scale distributed training of sequential deep learning models, built on Numpy and MPI.
☆131Updated last year
biohacker0 / crowRedis
Built Simple memory datastore like redis to learn how it works internally and how databases are built , it has set,get, transactions (mul…
☆30Updated last year
ekzhang / ekzhang.github.io
Source code for my personal website
☆46Updated 3 weeks ago
naklecha / llm-inference-optimizations-explained
in this repository, i'm going to implement increasingly complex llm inference optimizations
☆20Updated this week
snowclipsed / moondream-zig
moondream in zig.
☆65Updated 3 weeks ago
saurabhaloneai / Llama-3-From-Scratch-In-Pure-Jax
This repository contain the simple llama3 implementation in pure jax.
☆63Updated 2 months ago
kanpuriyanawab / picograd
Rust Implementation of micrograd
☆51Updated 10 months ago
Maharshi-Pandya / cudacodes
Learnings and programs related to CUDA
☆380Updated 2 months ago