ulrichstern/cuda-convnet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ulrichstern/cuda-convnet)

ulrichstern / cuda-convnet

Alex Krizhevsky's original code from Google Code

☆201

Alternatives and similar repositories for cuda-convnet

Users that are interested in cuda-convnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Red-Rabbit-Robotics / rx1_teleop
View on GitHub
☆12Sep 25, 2024Updated last year
goncalorafaria / qalign
View on GitHub
QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.
☆27Mar 2, 2026Updated 4 months ago
koyeb / tenstorrent-examples
View on GitHub
☆19Feb 7, 2026Updated 5 months ago
lucasdelimanogueira / PyNorch
View on GitHub
Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)
☆168Nov 25, 2025Updated 8 months ago
mueller-mp / maha-norm
View on GitHub
☆16May 30, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aquefir / neopolitan
View on GitHub
A new city of code on a cosmopolitan foundation.
☆21Mar 19, 2021Updated 5 years ago
gevtushenko / llm.c
View on GitHub
LLM training in simple, raw C/CUDA
☆114May 1, 2024Updated 2 years ago
clu0 / unet.cu
View on GitHub
UNet diffusion model in pure CUDA
☆661Jun 28, 2024Updated 2 years ago
DjagbleyEmmanuel / llamafile-convert_gguf_UI
View on GitHub
This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…
☆14Jan 2, 2026Updated 6 months ago
karpathy / llm.c
View on GitHub
LLM training in simple, raw C/CUDA
☆30,671Jun 26, 2025Updated last year
thesephist / spectre
View on GitHub
Sparse autoencoders for Contra text embedding models
☆25Apr 24, 2024Updated 2 years ago
better-mojo / learn-mojo-archived
View on GitHub
learn mojo
☆14Sep 26, 2024Updated last year
siyan-sylvia-li / arxivParser
View on GitHub
☆18Sep 21, 2023Updated 2 years ago
joshuacnf / Ctrl-G
View on GitHub
☆116Jun 18, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
idiap / sigma-gpt
View on GitHub
σ-GPT: A New Approach to Autoregressive Models
☆77Aug 14, 2024Updated last year
cloneofsimo / ptx-tutorial-by-aislop
View on GitHub
PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)
☆66Mar 24, 2025Updated last year
muellerzr / smol-moe
View on GitHub
☆25Oct 10, 2025Updated 9 months ago
ikawrakow / ik_llamafile
View on GitHub
Distribute and run LLMs with a single file.
☆25May 13, 2025Updated last year
krabicezpapundeklu / lemon-parser
View on GitHub
Lemon is an LALR(1) parser generator for C or C++.
☆17Jun 10, 2014Updated 12 years ago
shmup / redbean-calcpad
View on GitHub
CalcPad served with redbean
☆15Aug 18, 2022Updated 3 years ago
unixpickle / learn-ptx
View on GitHub
Learning about CUDA by writing PTX code.
☆160Feb 27, 2024Updated 2 years ago
InfrHQ / Replay
View on GitHub
An Infr app that helps you replay & talk to everything you've ever seen.
☆15Sep 19, 2023Updated 2 years ago
joey00072 / Attention-as-graph
View on GitHub
alternative way to calculating self attention
☆18May 25, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
yoheinakajima / llm_vs_vector
View on GitHub
Testing speed and cost of classification via LLM or via vector embeddings
☆21Aug 6, 2023Updated 2 years ago
woodrush / numsectorlisp
View on GitHub
Fixed-point scalar and matrix multiplication library for SectorLISP
☆15Jan 23, 2022Updated 4 years ago
devflowinc / yc-companies
View on GitHub
YC companies example built on Trieve
☆14Aug 1, 2024Updated last year
geohot / tinydreamer
View on GitHub
An implementation of delta-iris in tinygrad
☆75Aug 19, 2024Updated last year
mumu12641 / strawberry
View on GitHub
🍓 A toy object-oriented programming language written by rust
☆17Apr 10, 2024Updated 2 years ago
KhawajaAbaid / micrograd_c
View on GitHub
Andrej Kapathy's micrograd implemented in c
☆30Aug 7, 2024Updated last year
Bruce-Lee-LY / cuda_hgemv
View on GitHub
Several optimization methods of half-precision general matrix vector multiplication (HGEMV) using CUDA core.
☆75Sep 8, 2024Updated last year
siboehm / SGEMM_CUDA
View on GitHub
Fast CUDA matrix multiplication from scratch
☆1,267Sep 2, 2025Updated 10 months ago
salykova / sgemm.c
View on GitHub
Multi-Threaded FP32 Matrix Multiplication on x86 CPUs
☆378Apr 21, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
tom-pollak / claudette-pydantic
View on GitHub
☆10Oct 22, 2024Updated last year
empjustine / redbean-zipfile
View on GitHub
Serve assets in zipfiles inside or outside of Redbean
☆17Sep 1, 2022Updated 3 years ago
Snektron / gpumode-amd-fp8-mm
View on GitHub
My submission for the GPUMODE/AMD fp8 mm challenge
☆29Jun 4, 2025Updated last year
omkaark / simple-federated-learning
View on GitHub
☆96Apr 18, 2024Updated 2 years ago
jpe90 / hello-cosmo
View on GitHub
A makefile project to demonstrate building a portable Hello World executable with Cosmopolitan Libc
☆14Sep 6, 2022Updated 3 years ago
ruphin / pybin
View on GitHub
A python web application in a single binary
☆15Nov 14, 2023Updated 2 years ago
philipfabianek / ptx-playground
View on GitHub
A simple environment for writing and experimenting with hand-written CUDA PTX kernels.
☆18Sep 11, 2025Updated 10 months ago