therealoliver / Deepdive-llama3-from-scratchLinks

Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.

☆592

Alternatives and similar repositories for Deepdive-llama3-from-scratch

Users that are interested in Deepdive-llama3-from-scratch are comparing it to the libraries listed below

Sorting:

yousef-rafat / miniDiffusion
A reimplementation of Stable Diffusion 3.5 in pure PyTorch
☆576Updated last week
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆616Updated 3 months ago
joennlae / tensorli
Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).
☆252Updated last year
dhealy05 / frames_of_mind
Animating R1's thoughts.
☆382Updated 4 months ago
mlecauchois / micrograd-cuda
☆248Updated last year
anordin95 / run-llama-locally
Run and explore Llama models locally with minimal dependencies on CPU
☆191Updated 8 months ago
rentruewang / bocoel
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…
☆285Updated 3 weeks ago
mirth / chonky
Fully neural approach for text chunking
☆357Updated last month
slashml / amd_inference
Docker-based inference engine for AMD GPUs
☆231Updated 8 months ago
vlm-run / vlmrun-cookbook
Examples and guides for using the VLM Run API
☆279Updated 3 weeks ago
vlm-run / vlmrun-hub
A hub for various industry-specific schemas to be used with VLMs.
☆518Updated 3 weeks ago
labmlai / inspectus
LLM Analytics
☆668Updated 8 months ago
samvher / bert-for-laptops
A BERT that you can train on a (gaming) laptop.
☆209Updated last year
PsyChip / machina
OpenCV+YOLO+LLAVA powered video surveillance system
☆763Updated 2 weeks ago
Brandon-c-tech / RAG-logger
RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…
☆220Updated 6 months ago
Foreseerr / TScale
☆196Updated last month
Om-Alve / smolGPT
☆1,405Updated 4 months ago
Z-Gort / Reservoirs-Lab
☆279Updated 2 weeks ago
punnerud / Local_Knowledge_Graph
☆444Updated 9 months ago
stanford-mast / blast
Browser-LLM Auto-Scaling Technology
☆524Updated this week
ivanbelenky / RL
R.L. methods and techniques.
☆190Updated 7 months ago
okuvshynov / slowllama
Finetune llama2-70b and codellama on MacBook Air without quantization
☆447Updated last year
felafax / felafax
Felafax is building AI infra for non-NVIDIA GPUs
☆562Updated 5 months ago
babycommando / neuralgraffiti
Live-bending a foundation model’s output at neural network level.
☆259Updated 2 months ago
taylorai / aiq
ai for jq
☆242Updated 9 months ago
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Updated 2 months ago
dleemiller / WordLlama
Things you can do with the token embeddings of an LLM
☆1,445Updated 2 months ago
Dicklesworthstone / visual_astar_python
Generate Cool-Looking Mazes and Animations Illustrating the A* Pathfinding Algorithm
☆177Updated 3 months ago
arc53 / llm-price-compass
This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …
☆221Updated 6 months ago
natolambert / rlhf-book
Textbook on reinforcement learning from human feedback
☆1,052Updated this week