a1k0n / a1gptLinks

throwaway GPT inference

☆140

Alternatives and similar repositories for a1gpt

Users that are interested in a1gpt are comparing it to the libraries listed below

Sorting:

mlecauchois / micrograd-cuda
☆249Updated last year
robjinman / richard
Richard is gaining power
☆196Updated last month
wangyi-fudan / wyGPT
Wang Yi's GPT solution
☆142Updated last year
antirez / gguf-tools
GGUF implementation in C as a library and a tools CLI program
☆277Updated 6 months ago
trevorpogue / algebraic-nnhw
Algebraic enhancements for GEMM & AI accelerators
☆278Updated 5 months ago
salykova / sgemm.c
Multi-Threaded FP32 Matrix Multiplication on x86 CPUs
☆350Updated 3 months ago
joennlae / tensorli
Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).
☆253Updated last year
samvher / bert-for-laptops
A BERT that you can train on a (gaming) laptop.
☆209Updated last year
nirw4nna / dsc
Tensor library & inference framework for machine learning
☆106Updated 3 weeks ago
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Updated 3 months ago
Futrell / ziplm
☆252Updated 2 years ago
Maknee / minigpt4.cpp
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
☆568Updated last year
adamkarvonen / chess_llm_interpretability
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …
☆208Updated 8 months ago
dicroce / hnsw
Heirarchical Navigable Small Worlds
☆98Updated 3 months ago
maxilevi / raytracer
C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.
☆76Updated 2 years ago
kolinko / effort
An implementation of bucketMul LLM inference
☆221Updated last year
DiscoGrad / DiscoGrad
DiscoGrad - automatically differentiate across conditional branches in C++ programs
☆204Updated 10 months ago
AMD-AIG-AIMA / AMD-LLM
☆188Updated 11 months ago
kroggen / mamba.c
Inference of Mamba models in pure C
☆189Updated last year
Cerebras / gigaGPT
a small code base for training large models
☆307Updated 3 months ago
jart / morton
☆294Updated last year
moonshine-ai / qc_npu_benchmark
Code sample showing how to run and benchmark models on Qualcomm's Window PCs
☆100Updated 10 months ago
joennlae / halutmatmul
Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator
☆211Updated last year
symisc / tiny-dream
Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation
☆264Updated last year
bclarkson-code / Tricycle
Autograd to GPT-2 completely from scratch
☆115Updated 3 months ago
valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆367Updated 6 months ago
idoh / mamba.np
A pure NumPy implementation of Mamba.
☆223Updated last year
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆133Updated last year
jostmey / NakedAttention
Revealing example of self-attention, the building block of transformer AI models
☆131Updated 2 years ago
Foreseerr / TScale
☆196Updated 3 months ago