bclarkson-code / TricycleLinks

Autograd to GPT-2 completely from scratch

☆115

Alternatives and similar repositories for Tricycle

Users that are interested in Tricycle are comparing it to the libraries listed below

Sorting:

adamkarvonen / chess_llm_interpretability
Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …
☆209Updated 8 months ago
idoh / mamba.np
A pure NumPy implementation of Mamba.
☆223Updated last year
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Updated 3 months ago
joennlae / tensorli
Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).
☆253Updated last year
valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆367Updated 6 months ago
mlecauchois / micrograd-cuda
☆249Updated last year
sumo43 / loopvlm
run paligemma in real time
☆131Updated last year
neurallambda / awesome-reasoning
a curated list of data for reasoning ai
☆137Updated last year
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆133Updated last year
Cerebras / gigaGPT
a small code base for training large models
☆308Updated 3 months ago
joey00072 / Tinytorch
A really tiny autograd engine
☆95Updated 2 months ago
ScalingIntelligence / tokasaurus
☆388Updated last week
saurabhaloneai / Llama-3-From-Scratch-In-Pure-Jax
This repository contain the simple llama3 implementation in pure jax.
☆68Updated 5 months ago
vgel / logitloom
explore token trajectory trees on instruct and base models
☆134Updated 2 months ago
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆622Updated 4 months ago
rentruewang / bocoel
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…
☆286Updated this week
Futrell / ziplm
☆252Updated 2 years ago
kroggen / mamba.c
Inference of Mamba models in pure C
☆189Updated last year
revalo / tree-diffusion
Diffusion on syntax trees for program synthesis
☆470Updated last year
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
llmonpy / needle-in-a-needlestack
☆116Updated 6 months ago
kolinko / effort
An implementation of bucketMul LLM inference
☆221Updated last year
a1k0n / a1gpt
throwaway GPT inference
☆140Updated last year
samvher / bert-for-laptops
A BERT that you can train on a (gaming) laptop.
☆209Updated last year
naklecha / llm-inference-optimizations-explained
in this repository, i'm going to implement increasingly complex llm inference optimizations
☆64Updated 2 months ago
wbrickner / noise_step
noise_step: Training in 1.58b With No Gradient Memory
☆220Updated 7 months ago
babycommando / neuralgraffiti
Live-bending a foundation model’s output at neural network level.
☆266Updated 4 months ago
facebookresearch / searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
☆372Updated last year
FlorianDietz / comgra
A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…
☆288Updated 8 months ago
codelion / pts
Pivotal Token Search
☆119Updated 3 weeks ago