Antimatter543 / karpathy-NN-lecturesLinks

My runthrough of karpathy's lectures (with notes), building NN's from scratch, simple autoregressive language models, GPT models and learnt ML techniques.

☆11

Alternatives and similar repositories for karpathy-NN-lectures

Users that are interested in karpathy-NN-lectures are comparing it to the libraries listed below

Sorting:

teknium1 / RawTransform
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆30Updated 2 years ago
saurabhaloneai / Llama-3-From-Scratch-In-Pure-Jax
This repository contain the simple llama3 implementation in pure jax.
☆64Updated 3 months ago
RyanLucas3 / poasterGPT
A single notebook for fine-tuning GPT-3.5 turbo
☆32Updated 9 months ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆81Updated last year
knowrohit / know_medical_dialogues
KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…
☆24Updated last year
gi0nyx / GPT-Scratch
This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames
☆24Updated last year
yacineMTB / just-large-models
Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.
☆44Updated last year
andrew-silva / mlx-rlhf
An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.
☆29Updated 11 months ago
abacaj / transformers-docker
Run, build, test transformer models using docker
☆32Updated 2 years ago
qrsch / doubutsu
☆23Updated 10 months ago
jaymody / picoBERT
Like picoGPT but for BERT.
☆49Updated 2 years ago
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated last month
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆137Updated 11 months ago
1rgs / token-trekker-rs
☆13Updated 2 years ago
teknium1 / transformers-gptq-quant
☆48Updated last year
teknium1 / LLM-Logbook
Public reports detailing responses to sets of prompts by Large Language Models.
☆30Updated 5 months ago
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆76Updated last month
CarperAI / treasure_trove
☆22Updated last year
sfcompute / tinynarrations
A synthetic story narration dataset to study small audio LMs.
☆31Updated last year
OptimalFoundation / nadir
Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻
☆14Updated 11 months ago
saurabhaloneai / image-cap
image captioninggg🐳
☆11Updated 9 months ago
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆59Updated last year
omkaark / simple-federated-learning
☆98Updated last year
saurabhaloneai / the-tale-of-llm-and-vlms
in depth exploration of llm and vlms.(notes)
☆11Updated 8 months ago
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆100Updated last year
joey00072 / Tinytorch
A really tiny autograd engine
☆94Updated last week
attentionmech / smolbox
smolbox of recipies
☆28Updated last month
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆67Updated 9 months ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆78Updated 5 months ago
pacman100 / openhathi_instruct
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…
☆23Updated last year