Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
☆629Feb 24, 2025Updated last year
Alternatives and similar repositories for Deepdive-llama3-from-scratch
Users that are interested in Deepdive-llama3-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆1,472Feb 15, 2025Updated last year
- Animating R1's thoughts.☆380Feb 17, 2025Updated last year
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆704Jun 14, 2025Updated last year
- Migrate from Docker to Podman.☆386Apr 2, 2025Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆637Mar 23, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Docker-based inference engine for AMD GPUs☆233Oct 7, 2024Updated last year
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆361May 21, 2025Updated last year
- A browser-based, WebGL2 implementation of GPT-2 with transform block and attention matrix visualization☆341Oct 24, 2025Updated 7 months ago
- Run and explore Llama models locally with minimal dependencies on CPU☆188Oct 12, 2024Updated last year
- Deep Reinforcement Learning: Zero to Hero!☆2,288May 26, 2026Updated 2 weeks ago
- See Through Your Models☆403Jul 8, 2025Updated 11 months ago
- Examples and guides for using the VLM Run API☆309Apr 10, 2026Updated 2 months ago
- ☆10Feb 14, 2025Updated last year
- Things you can do with the token embeddings of an LLM☆1,451Dec 1, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆227Dec 24, 2024Updated last year
- LLM Analytics☆714Oct 19, 2024Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,261Aug 27, 2025Updated 9 months ago
- A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.☆4,270Updated this week
- Minimal LLM inference in Rust☆1,034Oct 24, 2024Updated last year
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)☆682May 13, 2026Updated last month
- Personal Site☆20May 23, 2026Updated 3 weeks ago
- Neurox control helm chart details☆30Apr 29, 2025Updated last year
- llama3 implementation one matrix multiplication at a time☆15,230May 23, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Fully neural approach for text chunking☆410Oct 23, 2025Updated 7 months ago
- ☆198May 5, 2025Updated last year
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆24Nov 26, 2025Updated 6 months ago
- A TypeScript library to create platform-agnostic applications☆72Jun 6, 2026Updated last week
- 《汇编语言一发入魂》配套代码☆15May 30, 2020Updated 6 years ago
- World's first Nintendo 3DS emulator for Apple devices based on Citra.☆18Apr 7, 2023Updated 3 years ago
- ☆48Apr 2, 2025Updated last year
- Create mind maps to learn new things using AI.☆571Nov 2, 2024Updated last year
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…☆450Nov 24, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- High-Performance Implementation of OpenAI's TikToken.☆475Jul 3, 2025Updated 11 months ago
- Dead Simple LLM Abliteration☆269Mar 2, 2026Updated 3 months ago
- Live-bending a foundation model’s output at neural network level.☆273Apr 7, 2025Updated last year
- A hub for various industry-specific schemas to be used with VLMs.☆548Dec 15, 2025Updated 6 months ago
- Mentra Smart Glasses Hackathon - sheet music in AR☆71Apr 27, 2025Updated last year
- Have a natural, spoken conversation with AI!☆3,753Jul 11, 2025Updated 11 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,860Apr 18, 2025Updated last year