therealoliver / Deepdive-llama3-from-scratchLinks
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
☆609Updated 6 months ago
Alternatives and similar repositories for Deepdive-llama3-from-scratch
Users that are interested in Deepdive-llama3-from-scratch are comparing it to the libraries listed below
Sorting:
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆670Updated 3 months ago
- Run and explore Llama models locally with minimal dependencies on CPU☆190Updated 11 months ago
- Animating R1's thoughts.☆384Updated 7 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆622Updated 5 months ago
- Examples and guides for using the VLM Run API☆293Updated 2 months ago
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆200Updated 2 months ago
- High-Performance Implementation of OpenAI's TikToken.☆451Updated 2 months ago
- Multimodal RAG to search and interact locally with technical documents of any kind☆252Updated last month
- ☆248Updated last year
- ☆196Updated 4 months ago
- ☆1,431Updated 7 months ago
- R.L. methods and techniques.☆199Updated 10 months ago
- LLM Analytics☆677Updated 10 months ago
- Docker-based inference engine for AMD GPUs☆231Updated 11 months ago
- A hub for various industry-specific schemas to be used with VLMs.☆533Updated 3 months ago
- ☆417Updated 3 weeks ago
- See Through Your Models☆400Updated 2 months ago
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆359Updated 3 months ago
- Fully neural approach for text chunking☆370Updated 4 months ago
- Transcribe PDFs with local LLMs☆670Updated 2 weeks ago
- OpenCV+YOLO+LLAVA powered video surveillance system☆774Updated 2 weeks ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆253Updated last year
- Browser-LLM Auto-Scaling Technology☆548Updated 3 weeks ago
- ☆189Updated last year
- A concise text on quantum mechanics, intended for a general mathematical audience including CS, engineering, math, and physics undergrads…☆144Updated last week
- CleverBee - The Open Source Deep Researcher Tool☆307Updated 3 months ago
- Generate Cool-Looking Mazes and Animations Illustrating the A* Pathfinding Algorithm☆177Updated 6 months ago
- Ultra-lightweight AI Agent☆387Updated 3 weeks ago
- A BERT that you can train on a (gaming) laptop.☆209Updated 2 years ago
- ☆279Updated 3 months ago