Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
☆628Feb 24, 2025Updated last year
Alternatives and similar repositories for Deepdive-llama3-from-scratch
Users that are interested in Deepdive-llama3-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆1,466Feb 15, 2025Updated last year
- Animating R1's thoughts.☆382Feb 17, 2025Updated last year
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆699Jun 14, 2025Updated 9 months ago
- Migrate from Docker to Podman.☆384Apr 2, 2025Updated 11 months ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆632Mar 23, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Docker-based inference engine for AMD GPUs☆233Oct 7, 2024Updated last year
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆362May 21, 2025Updated 10 months ago
- A browser-based, WebGL2 implementation of GPT-2 with transform block and attention matrix visualization☆343Oct 24, 2025Updated 5 months ago
- Run and explore Llama models locally with minimal dependencies on CPU☆188Oct 12, 2024Updated last year
- Deep Reinforcement Learning: Zero to Hero!☆2,268Oct 27, 2025Updated 5 months ago
- See Through Your Models☆400Jul 8, 2025Updated 8 months ago
- Examples and guides for using the VLM Run API☆309Jan 27, 2026Updated 2 months ago
- ☆10Feb 14, 2025Updated last year
- Things you can do with the token embeddings of an LLM☆1,453Dec 1, 2025Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆227Dec 24, 2024Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,232Aug 27, 2025Updated 7 months ago
- LLM Analytics☆708Oct 19, 2024Updated last year
- A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.☆4,029Updated this week
- Minimal LLM inference in Rust☆1,035Oct 24, 2024Updated last year
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆685May 20, 2025Updated 10 months ago
- Personal Site☆20Jan 11, 2026Updated 2 months ago
- llama3 implementation one matrix multiplication at a time☆15,255May 23, 2024Updated last year
- Neurox control helm chart details☆30Apr 29, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Fully neural approach for text chunking☆407Oct 23, 2025Updated 5 months ago
- ☆200May 5, 2025Updated 10 months ago
- A TypeScript library to create platform-agnostic applications☆71Mar 15, 2026Updated last week
- 《汇编语言一发入魂》配套代码☆15May 30, 2020Updated 5 years ago
- ☆48Apr 2, 2025Updated 11 months ago
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…☆449Nov 24, 2025Updated 4 months ago
- Create mind maps to learn new things using AI.☆571Nov 2, 2024Updated last year
- High-Performance Implementation of OpenAI's TikToken.☆473Jul 3, 2025Updated 8 months ago
- Dead Simple LLM Abliteration☆257Mar 2, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Live-bending a foundation model’s output at neural network level.☆274Apr 7, 2025Updated 11 months ago
- From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)☆796Oct 30, 2024Updated last year
- A hub for various industry-specific schemas to be used with VLMs.☆541Dec 15, 2025Updated 3 months ago
- Have a natural, spoken conversation with AI!☆3,580Jul 11, 2025Updated 8 months ago
- Mentra Smart Glasses Hackathon - sheet music in AR☆70Apr 27, 2025Updated 10 months ago
- My personal standard for how to set up a Javascript workspace☆15Jul 16, 2023Updated 2 years ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,795Apr 18, 2025Updated 11 months ago