gi0nyx / GPT-ScratchLinks

This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames

☆24

Alternatives and similar repositories for GPT-Scratch

Users that are interested in GPT-Scratch are comparing it to the libraries listed below

Sorting:

sankalp1999 / semantweet-search
Vector search over tweets from the tweet archive using OpenAI embeddings and LanceDB
☆54Updated last year
Nearcyan / papers.day
papers.day
☆91Updated last year
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 11 months ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
knowrohit / know_medical_dialogues
KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…
☆24Updated last year
teknium1 / ShareGPT-Builder
☆115Updated 7 months ago
omkaark / simple-federated-learning
☆96Updated last year
yacineMTB / just-large-models
Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.
☆44Updated last year
teknium1 / RawTransform
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆30Updated 2 years ago
joey00072 / Tinytorch
A really tiny autograd engine
☆95Updated 2 months ago
macrocosmcorp / macrocosm-terminal
My name is Ozymandias, King of Kings; Look on my Works, ye Mighty, and despair!
☆40Updated last year
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆62Updated 9 months ago
hitorilabs / navi
compute, storage, and networking infra at home
☆65Updated last year
ishan0102 / rsrch.space
Stream of my favorite papers and links
☆42Updated 4 months ago
samefarrar / entropix_mlx
Modify Entropy Based Sampling to work with Mac Silicon via MLX
☆49Updated 9 months ago
naklecha / factorio-automation
i will automate factorio
☆108Updated last year
spikedoanz / from-bits-to-intelligence
could we make an ml stack in 100,000 lines of code?
☆46Updated last year
qrsch / doubutsu
☆24Updated last year
smolorg / smolvecstore
a tiny vectorstore implementation built with numpy.
☆62Updated last year
NousResearch / forge-api-demo
Simple demo showing how to use the Forge API by Nous Research
☆12Updated 8 months ago
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆101Updated last year
snowclipsed / moondream-zig
moondream in zig.
☆73Updated 2 months ago
thesephist / spectre
Sparse autoencoders for Contra text embedding models
☆25Updated last year
Laz4rz / GPT-2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆172Updated last year
Figura-Labs-Inc / telegraf_nv_export
Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.
☆62Updated last year
NeoVertex1 / ComplexTensor
ComplexTensor: Machine Learning By Bridging Classical and Quantum Computation
☆77Updated 8 months ago
teknium1 / transformers-gptq-quant
☆47Updated last year
doomslide / hyperobject
Plotting (entropy, varentropy) for small LMs
☆98Updated 2 months ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 5 months ago